Method and system for classifying a question

ABSTRACT

A method, implemented on at least one computing device, each of which has at least one processor, storage, and a communication platform connected to a network for classifying a question is disclosed. A question is received from a person. A question pattern is determined. A model selected based on the question is retrieved. Further, a decision is made as to whether the question is a personal question based on the question pattern and the selected model.

CROSS-REFERENCE TO RELATED APPLICATION

The present application is a continuation of U.S. patent applicationSer. No. 14/874,926 filed Oct. 5, 2015 and is related to U.S. patentapplication Ser. No. 14/874,852 filed on Oct. 5, 2015, entitled METHODAND SYSTEM FOR PROVIDING SYNTHETIC ANSWERS TO A PERSONAL QUESTION, allof which are incorporated herein by reference in their entireties.

BACKGROUND 1. Technical Field

The present teaching generally relates to organizing, retrieving,presenting, and utilizing information. Specifically, the presentteaching relates to methods and systems for classifying a question intoa personal question or a non-personal question and providing syntheticanswers to a personal question.

2. Discussion of Technical Background

The Internet has made it possible for a person to electronically accessvirtually any content at any time and from any location. The Internettechnology facilitates information publishing, information sharing, anddata exchange in various spaces and among different persons. One problemassociated with the rapid growth of the Internet is the so-called“information explosion,” which is the rapid increase in the amount ofavailable information and the effects of this abundance. As the amountof available information grows, the problem of managing the informationbecomes more difficult, which can lead to information overload. With theexplosion of information, it has become more and more important toprovide users with information from a public space that is relevant tothe individual person and not just information in general.

In addition to the public space such as the Internet, semi-privatespaces including social media and data sharing sites have become anotherimportant source where people can obtain and share information in theirdaily lives. The continuous and rapid growth of social media and datasharing sites in the past decade has significantly impacted thelifestyles of many; people spend more and more time on chatting andsharing information with their social connections in the semi-privatespaces or use such semi-private sources as additional means forobtaining information and entertainment. Similar to what has happened inthe public space, information explosion has also become an issue in thesocial media space, especially in managing and retrieving information inan efficient and organized manner.

Private space is another data source used frequently in people'severyday lives. For example, personal emails in Yahoo! mail, Gmail,Outlook etc. and personal calendar events are considered as privatesources because they are only accessible to a person when she or he logsin using private credentials. Although most information in a person'sprivate space may be relevant to the person, it is organized in asegregated manner. For example, a person's emails may be organized bydifferent email accounts and stored locally in different emailapplications or remotely at different email servers. As such, to get afull picture of some situation related to, e.g., some event, a personoften has to search different private spaces to piece everythingtogether. For example, to check with a friend of the actual arrival timefor a dinner, one may have to first check a particular email (in theemail space) from the friend indicating the time the friend will arrive,and then go to Contacts (a different private space) to search for thefriend's contact information before making a call to the friend toconfirm the actual arrival time. This is not convenient.

The segregation of information occurs not only in the private space, butalso in the semi-private and public spaces. This has led to anotherconsequential problem given the information explosion: requiring one toconstantly look for information across different segregated spaces topiece everything together due to lack of meaningful connections amongpieces of information that are related in actuality yet isolated indifferent segregated spaces.

Efforts have been made to organize the huge amount of availableinformation to assist a person to find the relevant information.Conventional scheme of such effort is application-centric and/ordomain-centric. Each application carves out its own subset ofinformation in a manner that is specific to the application and/orspecific to a vertical or domain. For example, such attempt is eitherdedicated to a particular email account (e.g., www.Gmail.com) orspecific to an email vertical (e.g., Outlook); a traditional web topicalportal allows users to access information in a specific vertical, suchas www.IMDB.com in the movies domain and www.ESPN.com in the sportsdomain. In practice, however, a person often has to go back and forthbetween different applications, sometimes across different spaces, inorder to complete a task because of the segregated and unorganizednature of information existing in various spaces. Moreover, even withina specific vertical, the enormous amount of information makes it tediousand time consuming to find the desired information.

Another line of effort is directed to organizing and providinginformation in an interest-centric manner. For example, user groups ofsocial media in a semi-private space may be formed by common interestsamong the group members so that they can share information that islikely to be of interest to each other. Web portals in the public spacestart to build user profiles for individuals and recommend content basedon an individual person's interests, either declared or inferred. Theeffectiveness of interest-centric information organization andrecommendation is highly relied on the accuracy of user profiling.Oftentimes, however, a person may not like to declare her/his interests,whether in a semi-private space or a public space. In that case, theaccuracy of user profiling can only be relied on estimation, which canbe questionable. Accordingly, neither of the application-centric,domain-centric, and interest-centric ways works well in dealing with theinformation explosion challenge.

FIG. 1 depicts a traditional scheme of information organization andretrieval in different spaces in a segregated and disorganized manner. Aperson 102 has to interact with information in private space 104,semi-private space 106, and public space 108 via unrelated and separatemeans 110, 112, 114, respectively. For accessing private data from theprivate space 104, means 110, such as email applications, email sites,local or remote Contacts and calendars, etc., has to be selected andused. Each means 110 is domain or application-oriented, allowing theperson 102 to access information related to the domain with the specificapplication that the means 110 is developed for. Even for informationresiding within different applications/domains in the private space 104,a person 102 still has to go by different means 110 to access content ofeach application/domain, which is not convenient and not person-centric.For example, in order to find out the phone numbers of attendees of abirthday party, the person 102 has to first find all the confirmationemails from the attendees (may be sent in different emails and even todifferent email accounts), write down each name, and open differentContacts to look for their phone numbers.

Similarly, for interacting with the semi-private space 106, a person 102needs to use a variety of means 112, each of which is developed anddedicated for a specific semi-private data source. For example, Facebookdesktop application, Facebook mobile app, and Facebook site are allmeans for accessing information in the person 102's Facebook account.But when the person 102 wants to open any document shared on Dropbox bya Facebook friend, the person 102 has to switch to another meansdedicated to Dropbox (a desktop application, a mobile app, or awebsite). As shown in FIG. 1, information may be transmitted between theprivate space 104 and the semi-private space 106. For instance, privatephotos can be uploaded to a social media site for sharing with friends;social media or data sharing sites may send private emails to a person102's private email account notifying her/him of status updates ofsocial friends. However, such information exchange does notautomatically create any linkage between data between the private andsemi-private spaces 104, 106. Thus, there is no application that cankeep track of such information exchange and establish meaningfulconnections, much less utilizing the connections to make it easier tosearch for information.

As to the public space 108, means 114 such as traditional search engines(e.g., www.Google.com) or web portals (e.g., www.CNN.com, www.AOL.com,www.IMDB.com, etc.) are used to access information. With the increasingchallenge of information explosion, various efforts have been made toassist a person 102 to efficiently access relevant and on-the-pointcontent from the public space 108. For example, topical portals havebeen developed that are more domain-oriented as compared to genericcontent gathering systems such as traditional search engines. Examplesinclude topical portals on finance, sports, news, weather, shopping,music, art, movies, etc. Such topical portals allow the person 102 toaccess information related to subject matters that these portals aredirected to. Vertical search has also been implemented by major searchengines to help to limit the search results within a specific domain,such as images, news, or local results. However, even if limiting thesearch result to a specific domain in the public space 108, there isstill an enormous amount of available information, putting much burdenon the person 102 to identify desired information.

There is also information flow among the public space 108, thesemi-private space 106, and the private space 104. For example,www.FedeEx.com (public space) may send a private email to a person 102'semail account (private space) with a tracking number; a person 102 mayinclude URLs of public websites in her/his tweets to followers. However,in reality, it is easy to lose track of related information residing indifferent spaces. When needed, much effort is needed to dig them outbased on memory via separate means 110, 112, 114 across different spaces104, 106, 108. In today's society, this consumes more and more people'stime.

Because information residing in different spaces or even within the samespace is organized in a segregated manner and can only be accessed viadedicated means, the identification and presentation of information fromdifferent sources (whether from the same or different spaces) cannot bemade in a coherent and unified manner. For example, when a person 102searches for information using a query in different spaces, the resultsyielded in different search spaces are different. For instance, searchresult from a conventional search engine directed to the public space108 is usually a search result page with “blue links,” while a search inthe email space based on the same query will certainly look completelydifferent. When the same query is used for search in different socialmedia applications in the semi-private space 106, each application willagain likely organize and present the search result in a distinctmanner. Such inconsistency affects user experience. Further, relatedinformation residing in different sources is retrieved piece meal sothat it requires the person 102 to manually connect the dots provide amental picture of the overall situation.

Therefore, there is a need for improvements over the conventionalapproaches to organize, retrieve, present, and utilize information.

SUMMARY

The present teaching relates to methods, systems, and programming forclassifying a question and providing synthetic answers to a personalquestion.

In one embodiment, a method, implemented on at least one computingdevice, each of which has at least one processor, storage, and acommunication platform connected to a network for classifying a questionis disclosed. A question is received from a person. A question patternis determined. A model selected based on the question is retrieved.Further, a decision is made as to whether the question is a personalquestion based on the question pattern and the selected model.

In some embodiments, the question pattern comprises at least aninterrogative word extracted from the question.

In some embodiments, the selected model comprises one or more personalquestion patterns generated based on information associated with aplurality of previously asked questions.

In some embodiments, the one or more personal question patterns aretrained based on user interactions associated with a plurality ofanswers to the plurality of previously asked questions.

In some embodiments, the one or more personal question patterns aretrained based on user interactions associated with a plurality ofanswers to the plurality of previously asked questions.

In some embodiments, the question pattern comprises at least an entityextracted from the question.

In some embodiments, the selected model is trained using a machinelearning system.

In some embodiments, the method further comprises searching one or moreanswers from a person-centric knowledge database if it is determinedthat the question is a personal question.

In another embodiment, a system having at least one processor, storage,and a communication platform connected to a network for classifying aquestion, is disclosed. The system comprises a user interfaceimplemented on the at least one processor and configured to receive aquestion from a person; a question pattern extractor implemented on theat least one processor and configured to determine a question patternfrom the question; a classification knowledge retriever implemented onthe at least one processor and configured to retrieve a model selectedbased on the question; and a classification decision implemented on theat least one processor and configured to determine whether the questionpattern is a personal question.

In yet another embodiment, a non-transitory machine-readable mediumhaving information recorded thereon for classifying a question, isdisclosed. The information, when read by the machine, causes the machineto perform receiving a question from a user; determining a questionpattern from the question; retrieving a model selected based on thequestion; and determining whether the question pattern is a personalquestion.

BRIEF DESCRIPTION OF THE DRAWINGS

The methods, systems, and/or programming described herein are furtherdescribed in terms of exemplary embodiments. These exemplary embodimentsare described in detail with reference to the drawings. Theseembodiments are non-limiting exemplary embodiments, in which likereference numerals represent similar structures throughout the severalviews of the drawings, and wherein:

FIG. 1 (prior art) depicts a traditional scheme of informationorganization and retrieval from different spaces in a segregated anddisorganized manner;

FIG. 2 depicts a novel scheme of building a person-centric space for aperson by cross-linking data from different spaces and applicationsthereof, according to an embodiment of the present teaching;

FIG. 3 illustrates exemplary types of data sources in a private space;

FIG. 4 illustrates exemplary types of data sources in a semi-privatespace;

FIG. 5 depicts an exemplary system diagram of a person-centric INDEXsystem, according to an embodiment of the present teaching;

FIG. 6 is a flowchart of an exemplary process for building aperson-centric space, according to an embodiment of the presentteaching;

FIG. 7 is a flowchart of an exemplary process for applying aperson-centric space for digital personal assistance, according to anembodiment of the present teaching;

FIG. 8 depicts an exemplary scheme of building a person-centric spacefor each individual person via a person-centric INDEX system andapplications thereof, according to an embodiment of the presentteaching;

FIG. 9 depicts an exemplary scheme in which a variety of dynamic cardsare built and provided to a person based on different intents estimatedfor the same query in different contexts, according to an embodiment ofthe present teaching;

FIG. 10 illustrates an exemplary answer card, according to an embodimentof the present teaching;

FIG. 11 illustrates an exemplary search results card, according to anembodiment of the present teaching;

FIG. 12 depicts an exemplary scheme of automatic online order emailsummary and package tracking via cross-linked data in a person-centricspace, according to an embodiment of the present teaching;

FIG. 13 illustrates an exemplary task with a list of task actions forautomatic package tracking;

FIG. 14 illustrates a series of exemplary cards provided to a person inthe process of automatic online order email summary and packagetracking;

FIG. 15 illustrates exemplary entities extracted from a person-centricspace and their relationships established in the process of automaticonline order email summary and package tracking;

FIG. 16 depicts an exemplary system diagram for question classificationand providing synthetic answers to a personal question, according to anembodiment of the present teaching;

FIG. 17 depicts a detailed system diagram for question classificationand providing synthetic answers to a personal question, according toanother embodiment of the present teaching;

FIG. 18 is a flowchart of an exemplary process for questionclassification and providing synthetic answers to a personal question,according to an embodiment of the present teaching;

FIG. 19 depicts an exemplary system diagram of a question classifier,according to an embodiment of the present teaching;

FIG. 20 is a flowchart of an exemplary process of classifying aquestion, according to an embodiment of the present teaching;

FIG. 21 depicts an exemplary person-centric knowledge for classifying aquestion, according to an embodiment of the present teaching;

FIG. 22 is a flowchart of an exemplary process of classifying aquestion, according to another embodiment of the present teaching;

FIG. 23 depicts an exemplary person-centric knowledge for classifying aquestion, according to another embodiment of the present teaching;

FIG. 24 depicts an exemplary system diagram of a machine learning systemfor training the person-centric knowledge for classifying a question,according to an embodiment of the present teaching;

FIG. 25 is a flowchart of an exemplary process of a machine learningprocess for training the person-centric knowledge for classifying aquestion, according to an embodiment of the present teaching;

FIG. 26 is a flowchart of an exemplary process of a machine learningprocess for training the person-centric knowledge for classifying aquestion, according to another embodiment of the present teaching;

FIG. 27 depicts an example of determining the classification accuracybased on two parameters, according to another embodiment of the presentteaching;

FIG. 28 depicts an exemplary system diagram for providing syntheticanswers to a personal question, according to an embodiment of thepresent teaching;

FIG. 29 depicts an exemplary system diagram for providing syntheticanswers to a personal question, according to another embodiment of thepresent teaching;

FIG. 30 depicts an exemplary system diagram for providing syntheticanswers to a personal question, according to another embodiment of thepresent teaching;

FIG. 31 depicts an exemplary system diagram for generating syntheticanswers to a personal question, according to an embodiment of thepresent teaching;

FIG. 32 is a flowchart of an exemplary process for generating syntheticanswers to a personal question, according to an embodiment of thepresent teaching;

FIG. 33 depicts an example of generating person-centric knowledge,according to an embodiment of the present teaching;

FIG. 34 depicts exemplary person-centric knowledge for providingsynthetic answers to a personal question, according to an embodiment ofthe present teaching;

FIG. 35 depicts an exemplary person-centric knowledge for providingsynthetic answers to a personal question, according to anotherembodiment of the present teaching;

FIG. 36 depicts an exemplary person-centric knowledge for providingsynthetic answers to a personal question, according to anotherembodiment of the present teaching;

FIG. 37 depicts an example of obtaining synthetic answers to a personalquestion based on the person-centric knowledge, according to anembodiment of the present teaching;

FIG. 38 depicts a general mobile device architecture on which thepresent teaching can be implemented; and

FIG. 39 depicts a general computer architecture on which the presentteaching can be implemented.

DETAILED DESCRIPTION

In the following detailed description, numerous specific details are setforth by way of examples in order to provide a thorough understanding ofthe relevant teachings. However, it should be apparent to those skilledin the art that the present teachings may be practiced without suchdetails. In other instances, well known methods, procedures, components,and/or circuitry have been described at a relatively high-level, withoutdetail, in order to avoid unnecessarily obscuring aspects of the presentteachings.

The present teaching describes methods, systems, and programming aspectsof efficiently and effectively organizing, retrieving, presenting, andutilizing information.

FIG. 2 depicts a novel scheme of building a person-centric space 200 fora person 102 by cross-linking data from different spaces andapplications thereof, according to an embodiment of the presentteaching. Unlike the traditional approach to organize information indifferent spaces in a segregated and disorganized manner, as illustratedin FIG. 1, FIG. 2 provides a person-centric INDEX system 202, whichbuilds the person-centric space 200 specific to the person 102 bydigesting information from the public space 108, semi-private space 106,and private space 104 and cross-linking relevant data from those spaces104, 106, 108. As described herein, a person 102 referred herein mayinclude a human being, a group of people, an organization such as abusiness department or a corporation, or any unit that can use theperson-centric INDEX system 202. A space, whether private, semi-private,or public, may be a collection of information in one or more sources.Through the person-centric INDEX system 202, information relevant to theperson 102 from each of the private, semi-private, and public spaces104, 106, and 108 is projected, into the person-centric space 200 in ameaningful manner. That is, a part of the data in the person-centricspace 200 is projected from the public space 108 in a manner relevant tothe person 102; a part of the data in the person-centric space 200 isprojected from the semi-private space 106 in a manner relevant to theperson 102; a part of the data in the person-centric space 200 isprojected from the private space 104. Thus, the person-centric space 200is an information universe meaningful to the person 102 and formed fromthe perspective of the person 102.

Different from conventional approaches, which organize information in anapplication-centric, domain-centric, or interest-centric manner, theperson-centric INDEX system 202 recognizes relevant information from theenormous information available in the public space 108, semi-privatespace 106, and private space 104 in accordance with the perspective ofthe person 102, thereby filtering out information that is not relevantto the person 102, assisting the person 102 to make sense out of therelevance among different pieces of information in the person-centricspace. The person-centric space 200 is dynamic and changes with theonline (possibly offline) activities of the person 102. For example, theperson 102 can search more content via the person-centric INDEX system202 (this function may be similar to conventional search engine) thatwill lead to the continuously expansion of the person-centric space 200.The person-centric INDEX system 202 can cross-link data acrossinformation different spaces, or information from different sources inthe same space. For instance, by identifying a FedEx tracking number inan order confirmation email sent to a personal email account fromwww.Amazon.com, the person-centric INDEX system 202 can automaticallysearch for any information in any space that is relevant to the trackingnumber, such as package delivery status information from www.FedEx.comin the public space 108. Although most information from www.FedEx.commay not be related to the person 102, the particular package deliverystatus information relevant to the person 102 and can be retrieved bythe person-centric INDEX system 202 and indexed against the informationfrom the person 102's private emails. In other words, the packagedelivery status information, even though from the public space 108, canbe projected into the person-centric space 200 and, together with otherinformation in the person-centric space 200 (such as a confirmationemail related to the package), the person-centric INDEX system 202integrates relevant information from different sources to yield unifiedand semantically meaningful information, such as a card related to anorder incorporating the name of the ordered item, the name of the personwho ordered it, the name of the company that is to deliver the item, aswell as the current delivery status.

In another example, when a private email reminding of an upcoming soccergame from a coach is received, the person-centric INDEX system 202 maybe triggered to process the private email and identify, based on thecontent of the email, certain information in the sports domain such asdate/time, location, and players and coaches of the soccer game andcross link the email with such information. The person-centric INDEXsystem 202 may also retrieve additional relevant information from otherdata sources, such as phone number of the coach from Contacts of theperson 102. The person-centric INDEX system 202 may also retrieve mapand directions to the soccer game stadium from Google Maps based on thelocation information and retrieve weather forecast of the game fromwww.Weather.com based on the date. If the coach is connected with theperson 102 in any social media, then the person-centric INDEX system 202may go to the social media site in the semi-private space 106 toretrieve any content made by the coach that is relevant to the soccergame. In this example, all those different pieces of information fromthe public space 108, semi-private space 106, and private space 104 arecross-linked and projected to the person-centric space 200 in accordancewith the person 102's perspective on the soccer game.

The person-centric INDEX system 202 may build the initial person-centricspace 200 when the person 102 first time accesses the person-centricINDEX system 202. By analyzing all the information in the private space104 which the person 102 has granted access permission, theperson-centric INDEX system 202 can identify, retrieve, and linkrelevant information from the public space 108, semi-private space 106,and private space 104 and project them into the person-centric space200. As mentioned above, the person-centric INDEX system 202 alsomaintains and updates the person-centric space 200 in a continuous ordynamic manner. In one example, the person-centric INDEX system 202 mayautomatically check any change, either in the private space 104 orotherwise, based on a schedule and initiates the update of theperson-centric space 200 when necessary. For example, every two hours,the person-centric INDEX system 202 may automatically check any newemail that has not been analyzed before. In another example, theperson-centric INDEX system 202 may automatically check any changeoccurring in the public space 108 and the semi-private space 106 that isrelevant to the person 102. For instance, in the soccer game exampledescribed above, every day before the scheduled soccer game, theperson-centric INDEX system 202 may automatically check www.Weather.comto see if the weather forecast needs to be updated. The person-centricINDEX system 202 may also update the person-centric space 200 responsiveto some triggering event that may affect any data in the person-centricspace 200. For example, in the FedEx package example described above,once the scheduled delivery date has passed or a package delivery emailhas been received, the person-centric INDEX system 202 may update theperson-centric space 200 to remove the temporary relationship betweenthe person 102 and www.FedEx.com until a new connection between them isestablished again in the future. The triggering event is not limited toevents happening in the public space 108, semi-private space 106, orprivate space 104, but can include any internal operation of theperson-centric INDEX system 202. As an example, every time theperson-centric INDEX system 202 performs a search in response to a queryor to answer a question, it may also trigger the person-centric INDEXsystem 202 to update the person-centric space 200 based on, e.g., newlyretrieved information related to, e.g., a search result or some answers.When the search result or answers cannot be found in the person-centricspace 200, the person-centric INDEX system 202 may also update theperson-centric space 200 to include those search results and answers.That is, the person-centric INDEX system 202 may dynamically update theperson-centric space 200 in response to any suitable triggering events.

To better understand information in the person-centric space 200 andmake it meaningful, the person-centric INDEX system 202 may furtherbuild a person-centric knowledge database including person-centricknowledge by extracting and associating data about the person 102 fromthe person-centric space 200. The person-centric INDEX system 202 canextract entities related to the person 102 and infer relationshipsbetween the entities without the person 102's explicit declaration. Aperson-centric knowledge representation for the person 102 can becreated by person-centric INDEX system 202 the based on the entities andrelationships. The inference can be based on any information in theperson-centric space 200. The knowledge elements that can be inferred ordeduced may include the person 102's social contacts, the person 102'srelationships with places, events, etc.

In order to construct the person-centric knowledge representation, theperson-centric INDEX system 202 may extract entities from content in theperson 102's person-centric space 200. These entities can be places likerestaurants or places of interest, contact mentions like names, emails,phone numbers or addresses, and events with date, place and personsinvolved. In addition to extracting these mentions, the person-centricINDEX system 202 can resolve them to what they refer to (i.e. candisambiguate an extracted entity when it may refer to multipleindividuals). For example, a word “King” in a private email may refer toa title of a person who is the King of a country or refer to a person'slast name. The person-centric INDEX system 202 may utilize anyinformation in the person-centric space 200 to determine what type ofentity the word “King” refers to in the email. In addition todetermining an entity type for an extracted entity name, theperson-centric INDEX system 202 may also determine a specific individualreferred to by this entity name. As one instance, a person's first namemay refer to different contacts, and a same restaurant name can refer toseveral restaurants. The person-centric INDEX system 202 can make use ofcontextual information and/or textual metadata associated with theentity name in the email to disambiguate such cases, thereby providing ahigh precision resolution. With the precise disambiguation, theperson-centric INDEX system 202 can find right information fromunstructured personal data and provide it in a structured way (e.g. in agraph associated with the person 102). In contrast to a conventionalpersonal profile, the person-centric INDEX system 202 generates a singlepersonal graph for an individual to encompass connections, interests,and events associated with the person 102. It can be understood that aperson-centric knowledge may also be represented in a format other thana graph.

The person-centric INDEX system 202, in conjunction with theperson-centric space 200, may organize related information fromdifferent sources and provide the information to a person 102 in auser-friendly, unified presentation style. In addition to providingrequested information in any known format, such as hyperlinks on asearch results page, the person-centric INDEX system 202 may presentinformation in intent-based cards. Unlike existing entity-based searchresults cards organizing results based on an entity, the person-centricINDEX system 202 may focus on a person 102's intent to dynamically builda card for the person 102. The intent may be explicitly specified in thequery, or estimated based on the context, trending events, or anyknowledge derived from the person-centric space 200. Knowing the person102's intent when the card is created to answer the query, theperson-centric INDEX system 202 can provide relevant information on thecard. The relevant information may include partial informationassociated with the entity in the query, and/or additional informationfrom the person-centric space 200 that is related to the person'sintent. In the soccer game example described above, in response to theperson's query or question related to the soccer game, theperson-centric INDEX system 202 may estimate the person's intent is toknow the date/time of the game and thus, build a card that includes notonly the direct answer of the date/time but also other informationrelated to the soccer game in the person-centric space 200, such as themap and directions, weather forecast, and contact information of thecoach.

In one embodiment, knowing the current intent of the person 102, theperson-centric INDEX system 202 can anticipate the next intent of theperson 102, such that the current card provided by the person-centricINDEX system 202 can lead to next steps. For example, the person-centricINDEX system 202 can anticipate that after looking at the show times ofa new movie, the person 102 will be likely to buy tickets. In anotherembodiment, focusing on the person 102's intent, the person-centricINDEX system 202 can answer the person 102 with a card even when thereis no entity in the query or request (i.e., in a query-less oranticipatory use case). For example, if the person-centric INDEX system202 determines that the person 102 has a behavior pattern of searchingtraffic information from work place to home at 5 pm on workdays, thenfrom now on, the person-centric INDEX system 202 may automaticallygenerate and provide a notice card to the person 102 at around 5 pm onevery workday, to notify the person 102 about the traffic informationregardless whether a query is received from the person 102.

The person-centric INDEX system 202 can be used for both building theperson-centric space 200 for a person 102 and facilitating the person102 to apply the person-centric space 200 in a variety for applications.Instead of using different means 110, 112, 114 shown in FIG. 1 to accessdifferent data sources across different spaces, the person-centric INDEXsystem 202 can serve as a centralized interface between the person 102and her/his own person-centric space 200, thereby reducing the time andefforts spent by the person 102 on retrieving desired information or anyother applications. As different pieces of relevant information from thepublic space 108, semi-private space 106, and private space 104 havebeen projected to the person-centric space 200 in a well-organized way,they can be handled by a single person-centric INDEX system 202, therebyimproving the efficiency and effectiveness in finding the desiredinformation. For example, in the FedEx package example described above,any time the person wants to know the current status of the package,she/he no longer needs to dig out the email with the tracking number,write down the tracking number, and open www.FedEx.com in a browser andtype in the tracking number. The person-centric INDEX system 202 mayhave already stored the package status information since the time whenthe initial order email was received and have kept updating the packagestatus information in the person-centric space 200. So any time when theperson 102 inputs a request for package status update, either in theform of a search query or a question n, the person-centric INDEX system202 can go directly to retrieve the updated package status informationfrom the person-centric space 200 or automatically call the trackingapplication programing interface (API) of FedEx server with the storedtracking number for the current status update. The result is thenprovided to the person 102 without any additional efforts made by theperson 102. In some embodiments, the person 102 may not even need toexplicitly request the status update. Responsive to receiving the orderconfirmation email, the person-centric INDEX system 202 mayautomatically set up a task to regularly send the status update to theperson 102 until the package is delivered or may dynamically notify theperson 102 with any event, like if the package is delayed or lost.

In one aspect of the present teaching, the person-centric INDEX system202, in conjunction with the person-centric space 200, can be used foranswering questions. To achieve this, the person-centric INDEX system202 may classify a question from a person 102 into a personal questionor a non-personal question. In some embodiment, data from theperson-centric space 200 may be for classification. For example, aquestion related to “uncle Sam” may be classified as a personal questionif “uncle Sam” is a real person identified from the private contactlist. Once the question is classified as personal, the person-centricINDEX system 202 may extract various features including entities andrelationships from the question. The extracted entities andrelationships may be used by the person-centric INDEX system 202 totraverse a person-centric knowledge database derived from theperson-centric space 200. In some embodiments, the person-centricknowledge database may store data in a triple format including one ormore entities and a relationship between the one or more entities. Whenan exact match of relationship and entity are found, an answer isreturned. When there is no exact match, a similarity between thequestion and answer triples is taken into consideration and used to findthe candidate answers. In the “uncle Sam” example described above, ifthe question is “where is uncle Sam,” the person-centric INDEX system202 may search the person-centric knowledge database for any locationentity that has a valid relationship with the entity “uncle Sam.” In oneexample, a recent email may be sent by “uncle Sam,” and the email mayalso mention that he will be attending a conference on these days. Thelocation of the conference can be retrieved from the conference websitein the public space 108, stored in the person-centric space 200, andassociated with entity “uncle Sam.” Based on the relationship, theperson-centric INDEX system 202 can answer the question with thelocation of the conference. The person-centric INDEX system 202 thusprovides an efficient solution to search for answers to personalquestions and increases user engagement and content understanding.

In another aspect of the present teaching, the person-centric INDEXsystem 202, in conjunction with the person-centric space 200, can beused for task completion. Task completion often involves interactionswith different data sources across different spaces. A task such as“making mother's day dinner reservation” involves task actions such asidentifying who is my mother, checking what date is mother's day thisyear, finding out a mutually available time slot on mother's day for mymother and me, picking up a restaurant that my mother and I like, makingan online reservation on the restaurant's website, etc. Traditionally,in order to complete each task action, a person 102 has to open a numberof applications to access information from different sources acrossdifferent spaces and perform a series of tedious operations, such assearching for “mother's day 2015” in a search engine, checking my owncalendar and mother's shared calendar, digging out past emails about therestaurant reservations for dinners with my mother, making onlinereservation via a browser, etc. In contrast to the traditionalapproaches for task completion, the person-centric INDEX system 202 cancomplete the same task more efficiently and effectively because allpieces of information related to mother's day dinner reservation havealready been projected to the person-centric space 200. This makesautomatic task generation and completion using the person-centric INDEXsystem 202 become possible. In response to receiving an input of “makingmother's day dinner reservation” from a person 102, the person-centricINDEX system 202 can automatically generate the list of task actions asmentioned above and execute each of them based on information from theperson-centric space 200 and update the person 102 with the currentstatus of completing the task.

With the dynamic and rich information related to the person 102 that isavailable in the person-centric space 200, the person-centric INDEXsystem 202 can even automatically generate a task without any input fromthe person 102. In one embodiment, anytime a card is generated andprovided to the person 102, the information on the card may be analyzedby the person-centric INDEX system 202 to determine whether a task needsto be generated as a follow-up of the card. For example, once an emailcard summarizing an online order is constructed, the person-centricINDEX system 202 may generate a task to track the package status untilit is delivered and notify any status update for the person 102. Inanother embodiment, any event occurring in the public space 108,semi-private space 106, or private space 104 that is relevant to theperson 102 may trigger the task completion as well. For instance, aflight delay message on an airline website in the public space 108 maytrigger generation of a task for changing hotel, rental car, andrestaurant reservations in the same trip. In still another embodiment,the person 102's past behavior patterns may help the person-centricINDEX system 202 to anticipate her/his intent in the similar context andautomatically generate a task accordingly. As an instance, if the person102 always had a dinner with her/his mother on mother's day at the samerestaurant, a task may be generated by the person-centric INDEX system202 this year, in advance, to make the mother's day dinner reservationat the same restaurant.

It is understood that in some occasions, certain task actions may not becompleted solely based on information from the person-centric space 200.For example, in order to complete the task “sending flowers to mom onmother's day,” flower shops need to be reached out to. In one embodimentof the present teaching, a task exchange platform may be created tofacilitate the completion of tasks. The person-centric INDEX system 202may send certain tasks or task actions to the task exchange platform sothat parties interested in completing the task may make bids on it. Thetask exchange platform alone, or in conjunction with the person-centricINDEX system 202, may select the winning bid and update the person 102with the current status of task completion. Monetization of taskcompletion may be achieved by charging service fee to the winning partyand/or the person 102 who requests the task.

In still another aspect of the present teaching, the person-centricINDEX system 202, in conjunction with the person-centric space 200, canbe used for query suggestions. By processing and analyzing data from theperson-centric space 200, the person-centric INDEX system 202 may builda user corpus database, which provides suggestions based on informationfrom the private space 104 and/or semi-private space 106. In response toany input from a person 102, the person-centric INDEX system 202 mayprocess the input and provide suggestions to the person 102 at runtimebased on the person 102's relevant private and/or semi-private data fromthe user corpus database as well other general log-based querysuggestion database and search history database. The query suggestionsmay be provided to the person 102 with very low latency (e.g., less than10 ms) in response to the person 102's initial input. Further, in someembodiments, before presenting to the person 102, suggestions generatedusing the person 102's private and/or semi-private data from the usercorpus database may be blended with suggestions produced based ongeneral log-based query suggestion database and search history database.Such blended suggestions may be filtered and ranked based on variousfactors, such as type of content suggested (e.g., email, social mediainformation, etc.), estimated intent based on an immediate previousinput from the person 102, context (e.g., location, data/time, etc.)related to the person 102, and/or other factors.

FIG. 3 illustrates exemplary types of data sources in a private space.The private space of a person may include any data source that isprivate to the person. For example, the private space may include anydata source that requires access information of the person (e.g.,password, token, biometric information, or any user credentials). Theprivate space may also include any data source that is intended to beaccessed only by the person even without requiring access control, suchas data on a person's smart phone that does not require password orfinger print verification. In this illustration, the private spaceincludes several categories of data sources such as emails, contacts,calendars, instant messaging, photos, usage records, bookmarks, etc.Emails include emails stored in remote email servers such as Yahoo!Mail, Gmail, Hotmail, etc. and local emails in an email application on apersonal computer or mobile device. Instant messaging includes anymessages communicated between the person 102 and others via any instantmessaging applications, for example, Yahoo! Messenger, WhatsApp,Snapchat, to name a few. Usage records may be any logs private to theperson, such as, but not limited to, browsing history and call records.It is understood that the examples described above are for illustrativepurpose and are not intended to be limiting.

FIG. 4 illustrates exemplary types of data sources in a semi-privatespace. The semi-private space of a person may include any data sourcethat is accessible for a group of people designated by the person. Oneexample of data sources in the semi-private space is social media, suchas Tumblr, Facebook, Twitter, LinkedIn, etc. A person can designate agroup of people who can access her/his information shared in the socialmedia sites, such as status updates, posts, photos, and comments.Another example of data sources in the semi-private space is a contentsharing site. For instance, a person can share photos with family andfriends at Flickr, share work documents with colleagues or classmates atGoogle Docs, and share any files at Dropbox. It is understood that insome cases, there is not a clear boundary between a data source in theprivate space and a data source in the semi-private space. For instance,if a person restricts photos at Flickr to be only accessible byher/himself, then Flickr becomes a private source of the person, justlike local photos stored on the person's device. Similarly, when theentire or a portion of a calendar is shared with others, the calendarbecomes part of the semi-private space. It is understood that theexamples described above are for illustrative purpose and are notintended to be limiting.

FIG. 5 depicts an exemplary system diagram of the person-centric INDEXsystem 202, according to an embodiment of the present teaching. Theperson-centric INDEX system 202 includes a user interface 502 thatconnects a person 102 with multiple front-end components including asuggestion engine 504, a query interface 506, a Q/A interface 508, atask interface 510, and a contextual information identifier 512 coupledwith a user database 514. To support the front-end components, theperson-centric INDEX system 202 further includes multiple functionalcomponents including a search engine 516, a Q/A engine 518, a taskgeneration engine 520, a task completion engine 522, an intent engine524, a person-centric knowledge retriever 526, and a dynamic cardbuilder 528. In the back-end, the person-centric INDEX system 202includes a variety of databases for storing information in differentforms for different purposes, such as the person-centric space 200having a public database 544, a semi-private database 546, and a privatedatabase 548. The person-centric space 200 in this embodiment is builtup by a cross-linking engine 542. The person-centric INDEX system 202further includes a knowledge engine 530 for building a person-centricknowledge database 532 by processing and analyzing information in theperson-centric space 200. In addition, additional types of analyticresults from the knowledge engine 530 based on data from theperson-centric space 200 and/or any other suitable data sources may bestored in an intent database 534, a card module database 536, and a tasktemplate database 538.

A person 102 may interact with the person-centric INDEX system 202 viathe user interface 502 by providing an input. The input may be made by,for example, typing in a query, question, or task request, or clickingor touching any user interface element in the user interface 502 toenter a query, question, or task request. With each input from theperson 102, the suggestion engine 504 provides a list of suggestions tofacilitate the person 102 to complete the entire input. In thisembodiment, the suggestion engine 504 may provide suggestions based onthe person's private and/or semi-private information retrieved by theperson-centric knowledge retriever 526 from the person-centric space 200and/or the person-centric knowledge database 532. Those suggestionsinclude, for example, a contact name from the private contact list, partof a tweet from Twitter, or a package tracking status stored in theperson-centric space 200. In some embodiments, the suggestion engine 504may blend those suggestions based on the person 102's private and/orsemi-private information with the conventional suggestions based onpopular query logs and search history. In this embodiment, the intentengine 524 may provide an estimated intent associated with each input tohelp filtering and/or ranking the suggestions provided to the person102.

Each of the query interface 506, Q/A interface 508, and task interface510 is configured to receive a particular type of user inputs andforward them to the respective engine for handling. Once the results arereturned from the respective engine and/or from the dynamic card builder528, each of the query interface 506, Q/A interface 508, and taskinterface 510 forwards the results to the user interface 502 forpresentation. In one embodiment, the user interface 502 may firstdetermine the specific type of each input and then dispatch it to thecorresponding interface. For example, the user interface 502 mayidentify that an input is a question based on semantic analysis orkeyword matching (e.g., looking for keywords like “why” “when” “who,”etc. and/or a question mark). The identified question is then dispatchedto the Q/A interface 508. Similarly, the user interface 502 maydetermine, based on semantic analysis and/or machine learningalgorithms, that an input is a task request and forward the input to thetask interface 510. For any input that cannot be classified or does notfall within the categories of question and task request, the userinterface 502 may forward it to the query interface 506 for generalquery search. It is understood that, in some embodiments, the userinterface 502 may not classify an input first, but instead, forward thesame input to each of the query interface 506, Q/A interface 508, andtask interface 510 to have their respective engines to process the inputin parallel.

Another function of the user interface 502 involves presentinginformation to the person 102 either as responses to the inputs, such assearch results, answers, and task status, or as spontaneous notices,reminders, and updates in response to any triggering events. In thisembodiment, the information to be presented to the person 102 via theuser interface 502 may be presented in the form of cards that aredynamically built on-the-fly by the dynamic card builder 528 based onthe intent estimated by the intent engine 524. The cards may be ofdifferent types, such as an email card summarizing one or more relatedemails, a search results card summarizing information relevant to one ormore search results, an answer card including an answer to a questionwith additional information associated with the answer, or a notice cardthat is automatically generated to notify the person 102 of any event ofinterest. Based on its type, a card may be dispatched to one of thequery interface 506, Q/A interface 508, and task interface 510 andeventually presented to the person 102 via the user interface 502. Inaddition to cards, information in any other format or presentationstyles, such as search results in a research results page with “bluelinks” or answers in plain text, may be provided by the search engine516 and the Q/A engine 518 directly to the query interface 506 and Q/Ainterface 508, respectively. It is understood that the user interface502 may also provide information in a hybrid matter, meaning that someinformation may be presented as cards, while other information may bepresented in its native format or style.

As the user interface 502 receives an input from the person 102, it alsotriggers the contextual information identifier 512 to collect anycontextual information related to the person 102 and the input of theperson 102. The contextual information identifier 512 in this embodimentreceives user-related information from the user database 514, such asthe person 102's demographic information and declared and inferredinterests and preferences. Another source of contextual information isthe person 102's device including, for example, date/time obtained fromthe timer of the person 102's device, location obtained from a globalpositioning system (GPS) of the person 102's device, and informationrelated to the person 102's device itself (e.g., the device type, brand,and specification). Further, the contextual information identifier 512may also receive contextual information from the user interface 502,such as one or more inputs immediately before the current input (i.e.,user-session information). Various components in the person-centricINDEX system 202, including the cross-linking engine 542, knowledgeengine 530, and intent engine 524, may take advantage of the contextualinformation identified by the contextual information identifier 512.

The intent engine 524 in this embodiment has two major functions:creating and updating the intent database 534 and estimating an intentbased on the information stored in the intent database 534. The intentdatabase 534 may store a personal intent space which includes all theintents that make sense to the person 102 in the form of an action plusa domain. For example, based on the person 102's search history, theintent engine 524 may identify that the person 102 has repeatedlyentered different queries all related to the same intent “makingrestaurant reservations.” This intent then may be stored as a data pointin the person's personal intent space in the intent database 534 in theform of {action=making reservations; domain=restaurant}. More and moredata points will be filled into the personal intent space as the person102 continues interacting with the person-centric INDEX system 202. Insome embodiments, the intent engine 524 may also update the personalintent space in the intent database 534 by adding new intents based onexisting intents. For instance, the intent engine 524 may determine thathotel is a domain that is close to the restaurant domain and thus, a newintent “making hotel reservations” (in the form of {action=makingreservations; domain=hotel}) likely makes sense to the person 102 aswell. The new intent “making hotel reservations,” which is notdetermined from user data directly, may be added to the personal intentspace in the intent database 534 by the intent engine 524. In someembodiments, the intent database 534 includes a common intent space forthe general population. Some intent that is not in the personal intentspace may exist in the common intent space. If they are popular amongthe general population or among people similar to the person 102, thenthe intent engine 524 may consider those intents as candidates as wellin intent estimation.

In estimating intent of the person 102, the intent engine 524 receivesthe input from the user interface 502 or any information retrieved bythe person-centric knowledge retriever 526 and tries to identify anyaction and/or domain from the input that is also in the intent spaces inthe intent database 534. If both action and domain can be identifiedfrom the input, then an intent can be derived directly from the intentspace. Otherwise, the intent engine 524 may need to take the contextualinformation from the contextual information identifier 512 to filterand/or rank the intent candidates identified from the intent space basedon the action or domain. In one example, if the input involves only theaction “making reservations” without specifying the domain, the intentengine 524 may first identify a list of possible domains that can becombined with such action according to the personal intent space, suchas “hotel” and “restaurant.” By further identifying that the locationwhere the input is made is at a hotel, the intent engine 524 mayestimate that the person 102 likely intends to make restaurantreservations as he is already in the hotel. It is understood that insome cases, neither action nor domain can be identified from the inputor the identified action or domain does not exist in the intent space,the intent engine 524 may estimate the intent purely based on theavailable contextual information. Various components in theperson-centric INDEX system 202, including the search engine 516, thesuggestion engine 504, the dynamic card builder 528, and theperson-centric knowledge retriever 526, may take advantage of the intentestimated by the intent engine 524.

The search engine 516 in this embodiment receives a search query fromthe query interface 506 and performs a general web search or a verticalsearch in the public space 108. Intent estimated by the intent engine524 for the search query may be provided to the search engine 516 forpurposes such as query disambiguation and search results filtering andranking. In some embodiments, some or all of the search results may bereturned to the query interface 506 in their native format (e.g.,hyperlinks) so that they can be presented to the person 102 on aconventional search results page. In this embodiment, some or all of thesearch results are fed into the dynamic card builder 528 for building adynamic search results card based on the estimated intent. For instance,if the intent of the query “make reservation” is estimated as “makingrestaurant reservations,” then the top search result of a localrestaurant may be provided to the dynamic card builder 528 for buildinga search results card with the name, directions, menu, phone number, andreviews of the restaurant.

The Q/A engine 518 in this embodiment receives a question from the Q/Ainterface 508 and classifies the question into either a personal ornon-personal question. The classification may be done based on a modelsuch as a machine learning algorithm. In this embodiment, the Q/A engine518 may check the person-centric knowledge database 532 and/or theprivate database 548 and semi-private database 546 in the person-centricspace 200 via the person-centric knowledge retriever 526 to see if thequestion is related to any private, semi-private data, or personalknowledge of the person 102. For instance, the question “who is TaylorSwift” is normally classified as a non-personal question. But in thecase if “Taylor Swift” is in the person 102's contact list or socialmedia friend list, or if “Taylor Swift” has sent emails to the person102, the Q/A engine 518 then may classify the question as a personalquestion. For non-personal questions, any known approaches may be usedto obtain the answers.

Once the question is classified as personal, various features includingentities and relationships are extracted by the Q/A engine 518 from thequestion using, for example, a machine learned sequence tagger. Theextracted entities and relationships are used to traverse, by theperson-centric knowledge retriever 526, the person-centric knowledgedatabase 532, which stores person-centric relationships stored in apre-defined form. In some embodiments, the person-centric relationshipsmay be stored in a triple format including one or more entities and arelationship therebetween. When the Q/A engine 518 finds an exact matchof relationship and entity, it returns an answer. When there is no exactmatch, the Q/A engine 518 takes into consideration a similarity betweenthe question and answer triples and uses the similarity to find thecandidate answers. To measure the similarity, words embedded over alarge corpus of user texts may be collected and trained by the Q/Aengine 518. The well-organized, person-centric information stored in theperson-centric space 200 and the person-centric knowledge database 532makes it possible for the Q/A engine 518 to answer a personal questionin a synthetic manner without the need of fully understanding thequestion itself. The answers generated by the Q/A engine 518 may beprovided to the dynamic card builder 528 for building answer cards.

The task generation engine 520 and the task completion engine 522 worktogether in this embodiment to achieve automatic task generation andcompletion functions of the person-centric INDEX system 202. The taskgeneration engine 520 may automatically generate a task in response to avariety of triggers, including for example, a task request from theperson 120 received via the task interface 510, an answer generated bythe Q/A engine 518, a card constructed by the dynamic card builder 528,or an event or behavior pattern related to the person 102 from theperson-centric space 200 and/or the person-centric knowledge database532. Intent may have also been taken into account in some embodiments intask generation. The task generation engine 520 in this embodiment alsodivides each task into a series of task actions, each of which can bescheduled for execution by the task completion engine 522. The tasktemplate database 538 stores templates of tasks in response to differenttriggers. The task generation engine 520 may also access the tasktemplate database 538 to retrieve relevant templates in task generationand update the templates as needed. In some embodiments, the taskgeneration engine 520 may call the dynamic card builder 528 to build acard related to one or more tasks so that the person 102 can check andmodify the automatically generated task as desired.

The tasks and task actions are stored into task lists 540 by the taskgeneration engine 520. Each task may be associated with parameters, suchas conditions in which the task is to be executed and completed. Eachindividual task action of a task may also be associated with executionand completion conditions. The task completion engine 522 fetches eachtask from the task lists 540 and executes it according to the parameterassociated therewith. For a task, the task completion engine 522dispatches each of its task actions to an appropriate executor toexecute it, either internally through the person-centric knowledgeretriever 526 or externally in the public space 108, semi-private space106, or private space 104. In one example, task actions such as “findingavailable time on Tuesday for lunch with mom” can be completed byretrieving calendar information from the private database 548 in theperson-centric space 200. In another example, task actions like“ordering flowers from Aunt Mary's flower shop” can only be completed byreaching out to the flower shop in the public space 108. The taskcompletion engine 522 may also schedule the execution of each taskaction by putting it into a queue. Once certain conditions associatedwith a task action are met, the assigned executor will start to executeit and report the status. The task completion engine 522 may update thetask lists 540 based on the status of each task or task action, forexample, by removing completed tasks from the task lists 540. The taskcompletion engine 522 may also provide the status updates to theperson-centric knowledge retriever 526 such that the status updates ofany ongoing task become available for any component in theperson-centric INDEX system 202 as needed. For instance, the dynamiccard builder 528 may build a notice card notifying the person that yourtask request “sending flowers to mom on Mother's day” has beencompleted.

As a component that supports intent-based dynamic card construction forvarious front-end components, the dynamic card builder 528 receivesrequests from the search engine 516, the Q/A engine 518, the taskgeneration engine 520, or the person-centric knowledge retriever 526. Inresponse, the dynamic card builder 528 asks for the estimated intentassociated with the request from the intent engine 524. Based on therequest and the estimated intent, the dynamic card builder 528 cancreate a card on-the-fly by selecting suitable card layout and/ormodules from the card module database 536. The selection of modules andlayouts is not predetermined, but may depend on the request, the intent,the context, and information from the person-centric space 200 and theperson-centric knowledge database 532. Even for the same queryrepeatedly received from the same person 102, completely different cardsmay be built by the dynamic card builder 528 based on the differentestimated intents in different contexts. A card may be created bypopulating information, such as search results, answers, status updates,or any person-centric information, into the dynamically selected andorganized modules. The filling of information into the modules on a cardmay be done in a centralized manner by the dynamic card builder 528regardless of the type of the card or may be done at each componentwhere the request is sent. For example, the Q/A engine 518 may receivean answer card construction with dynamically selected and organizedmodules on it and fill in direct and indirect answers into those modulesby itself.

In one embodiment, the person-centric knowledge retriever 526 can searchthe person-centric space 200 and the person-centric knowledge database532 for relevant information in response to a search request from theintent engine 524, the query interface, the Q/A engine 518, thesuggestion engine 504, the dynamic card builder 528, or the taskgeneration engine 520. The person-centric knowledge retriever 526 mayidentify one or more entities from the search request and search for thematched entities in the person-centric knowledge database 532. Asentities stored in the person-centric knowledge database 532 areconnected by relationships, additional entities and relationshipsassociated with the matched entities can be returned as part of theretrieved information as well. As for searching in the person-centricspace 200, in one embodiment, the person-centric knowledge retriever 526may first look for private data in the private database 548 matching theentities in the search request. As data in the person-centric space 200are cross-linked by cross-linking keys, the entities and/or thecross-linking keys associated with the relevant private data may be usedfor retrieving additional information from the semi-private database 546and the public database 544. For instance, to handle a search requestrelated to “amazon package,” the person-centric knowledge retriever 526may first look for information in the private database 548 that isrelevant to “amazon package.” If an order confirmation email is found inthe private database 548, the person-centric knowledge retriever 526 mayfurther identify that the order confirmation email is associated with across-linking key “tracking number” in the package shipping domain.Based on the tracking number, the person-centric knowledge retriever 526then can search for any information that is also associated with thesame tracking number in the person-centric space 200, such as thepackage status information from www.FedEx.com in the public database544. As a result, the person-centric knowledge retriever 526 may returnboth the order confirmation email and the package status information asa response to the search request.

In some embodiments, the person-centric knowledge retriever 526 mayretrieve relevant information from multiple data sources in parallel andthen blend and rank all the retrieved information as a response to thesearch request. It is understood that information retrieved from eachsource may be associated with features that are unique for the specificsource, such as the feature “the number of recipients that are cc'd” inthe email source. In order to be able to blend and rank results fromdifferent sources, the person-centric knowledge retriever 526 maynormalize the features of each result and map them into the same scalefor comparison.

The cross-linking engine 542 in this embodiment associates informationrelevant to the person 102 from the private space 104, the semi-privatespace 106, and the public space 108 by cross-linking data based oncross-linking keys. The cross-linking engine 542 may first process allinformation in the private space 104 and identify cross-linking keysfrom the private space 104. For each piece of content in the privatespace 104, the cross-linking engine 542 may identify entities anddetermine the domain to which the content belongs. Based on the domain,one or more entities may be selected as cross-linking keys for thispiece of content. In one example, tracking number may be a cross-linkingkey in the package shipping domain. In another example, flight number,departure city, and departure date may be cross-linking keys in theflight domain. Once one or more cross-linking keys are identified foreach piece of information in the private space 104, the cross-linkingengine 542 then goes to the semi-private space 106 and the public space108 to fetch information related to the cross-linking keys. For example,the tracking number may be used to retrieve package status informationfrom www.FedEx.com in the public space 108, and the flight number,departure city, and departure date may be used to retrieve flight statusfrom www.UA.com in the public space 108. Information retrieved by thecross-linking engine 542 from the private space 104, semi-private space106, and public space 108 may be stored in the private database 548,semi-private database 546, and public database 544 in the person-centricspace 200, respectively. As each piece of information in theperson-centric space 200 is associated with one or more cross-linkingkeys, they are cross-linked with other information associated with thesame cross-linking keys, regardless which space it comes from. Moreover,as the cross-linking keys are identified based on the person's privatedata (e.g., emails), all the cross-linked information in theperson-centric space 200 are relevant to the person 102.

Although only one database is shown in FIG. 5 for information from theprivate space 104, the semi-private space 106, or the public space 108,it is understood that information within a particular space may beorganized and stored in different databases in the person-centric space200. For instance, private data from emails, contacts, calendars, andphotos may be stored in separate databases within the private database548; semi-private data from Facebook, Twitter, LinkedIn, etc. may bestored in separate databases within the semi-private database 546 aswell. Such arrangement may enable applying different feature extractionmodels to different data sources, which may be helpful for thesuggestion engine 504 and the person-centric knowledge retriever 526. Asmentioned above, the cross-linking engine 542 continuously anddynamically maintains and updates the person-centric space 200 on aregular basis and/or in response to any triggering event. For example,any internal operation, such as query search, question answering, ortask completion, may trigger the cross-linking engine 542 to update theaffected data or add missing data in the person-centric space 200.

The knowledge engine 530 in this embodiment processes and analyzes theinformation in the person-centric space 200 to derive analytic resultsin order to better understand the person-centric space 200. In oneembodiment, the knowledge engine 530 extracts entities from content inthe person-centric space 200 and resolves them to what they refer to(i.e., can disambiguate an extracted entity when it may refer tomultiple individuals). In addition to determining an entity type for anextracted entity name, the knowledge engine 530 may also determine aspecific individual referred to by this entity name. The knowledgeengine 530 can make use of contextual information and/or textualmetadata associated with the entity name in the email to disambiguatesuch cases, providing a high precision resolution.

The knowledge engine 530 also builds a person-centric knowledgerepresentation for a person 102 by extracting and associating data aboutthe person 102 from personal data sources. The person-centric knowledgerepresentation for the person 102 is stored in the person-centricknowledge database 532. The knowledge engine 530 can extract entitiesrelated to the person 102 and infer relationships between the entitieswithout the person 102's explicit declaration, and create, for example,a person-centric knowledge graph for the person 102 based on theentities and relationships. The knowledge elements that can be inferredor deduced may include, for example, the person 102's social contacts,and the person 102's relationships with places, events, or other users.

FIG. 6 is a flowchart of an exemplary process for building aperson-centric space, according to an embodiment of the presentteaching. Starting at 602, data from the private space 104 is obtained.The data includes any content that is private to a person, such asemails, contacts, calendar events, photos, bookmarks, instant messages,usage records, and so on. Contextual information is obtained at 604. Thecontextual information includes, but is not limited to, user informationsuch as demographic information and interests and preferences, localeinformation, temporal information, device information, and user-sessioninformation (e.g., other user inputs in the same or adjacentuser-sessions). At 606, information from the private space data isextracted. The information may be cross-linking keys determined fromentities extracted from the private space data based on the domain ofthe private space data and/or the obtained contextual information.Person-centric data is then retrieved from the semi-private space at608. Similarly, person-centric data is retrieved from the public spaceat 610. In this embodiment, the person-centric data is retrieved basedon the cross-linking keys. At 612, all pieces of person-centric dataretrieved from the private space, semi-private space, and public spaceare cross-linked together to generate a person-centric space. In thisembodiment, the cross-linking is done based on the same cross-linkingkeys associated with these pieces of person-centric data. At 614,analytic data is derived from the person-centric space. For example,entities may be extracted from the person-centric space and aredisambiguated by the knowledge engine 530 to ascertain their extractmeanings. Relationships between the entities may be inferred based oninformation from the person-centric space by the knowledge engine 530 aswell. Based on the entities and relationships, person-centric knowledgecan be derived and stored in the person-centric knowledge database 532.

FIG. 7 is a flowchart of an exemplary process for applying aperson-centric space for digital personal assistance, according to anembodiment of the present teaching. Starting at 702, an input from aperson is received. As the person enters the input, a preliminary intentis estimated and continuously updated at 704. The estimation may bebased on the current input and any contextual information currentlyavailable. At 706, one or more suggestions are generated based on thecurrent input and the estimated intent and provided to the person toassist completing the current input. A response to the suggestions isreceived from the person at 708. The response may be a selection of onesuggestion or ignoring the suggestions and finishing the input as theperson desires. Once the completed input is received, either as aselection of a suggestion or a fully-entered input, at 710, the intentis estimated again for the completed input. The intent may be estimatedbased on the completed input and the currently available contextualinformation. In some embodiments, if no input is received (e.g., whenthe person just logs into and has not entered anything yet), the intentmay be estimated based on the contextual information alone. At 712,person-centric knowledge is retrieved based on the input. In someembodiments, the estimated intent may be used for retrieving theperson-centric knowledge as well. As described above in detail, theinput may be a question, a task request, or a query. In any event,entities and/or relationships may be derived from the input and are usedfor retrieving relevant person-centric knowledge from the person-centricknowledge database 532. In some embodiments, additional information maybe retrieved from the person-centric space. Intent-based cards are builtat 714. Each card may be constructed based on a layout and one or moremodules that are selected based on the type of the card and theestimated intent. Content in each module may be filled in based on theperson-centric knowledge and any additional information retrieved at712. Optionally or additionally, at 718, the construction of a card maycause a task to be generated based on the estimated intent. Forinstance, an email card summarizing an online order confirmation emailmay trigger the generation of a task for automatically tracking thepackage status. At 720, the task is executed. Nevertheless, at 716, theintent-based cards, either an email card, an answer card, a searchresults card, or a notice card, are provided to the person as a responseto the input.

FIG. 8 depicts an exemplary scheme of building a person-centric spacefor each individual person via the person-centric INDEX system andapplications thereof, according to an embodiment of the presentteaching. In this embodiment, each person 102-1, . . . 102-n may accessits own person-centric INDEX system 202-1, . . . 202-n, respectively.The person-centric INDEX system 202 may be a stand-alone systeminstalled on each person 102-1, . . . 102-n's device, a cloud-basedsystem shared by different persons 102-1, . . . 102-n, or a hybridsystem in which some components are installed on each person 102-1, . .. 102-n's device and some components are in the cloud and shared bydifferent persons 102-1, . . . 102-n.

In this embodiment, individual person-centric spaces 200-1, . . . 200-nare generated for each person 102-1, . . . 102-n via its ownperson-centric INDEX system 202-1, . . . 202-n, respectively Forexample, person-centric space 1 200-1 includes the projections fromdifferent spaces related to person 1 102-1 from the perspectives ofperson 1 102-1 (e.g., the entire private space 1 104-1, parts of thesemi-private spaces 1-k 106-1, . . . 106-k that are relevant to person 1102-1, and a slice of the public space 108 that is relevant to person 1102-1). Each person 102-1, . . . 102-n then uses its own person-centricINDEX system 202-1, . . . 202-n to access its own person-centric space200-1, . . . 200-n, respectively. Based on inputs from a person to itsperson-centric INDEX system, outputs are returned based on informationfrom the person-centric space in any forms and styles, including, forexample, any conventional outputs such as search result pages with “bluelinks,” and any types of intent-based cards such as search resultscards, answer cards, email cars, notice cards, and so on.

FIG. 9 depicts an exemplary scheme in which a variety of dynamic cardsare built and provided to a person based on different intents estimatedfor the same query in different contexts, according to an embodiment ofthe present teaching. Conventionally, a static card that has beenpre-constructed for certain popular entities may be presented to aperson when the query from the person happens to include one of thosepopular entities. In contrast, intent-based cards according to thepresent teaching can be dynamically generated on-the-fly by theperson-centric INDEX system 202 responsive to a query 902 from theperson 102. In this example, the person 102 inputs the same query 902“super bowl” at different times. When the query 902 is entered threeweeks before the super bowl game, its temporal context 904 will likelycause the intent 906 to be estimated as “buying super bowl tickets.”Based on such intent, a card 908 is dynamically generated for buyingsuper bowl tickets, including information such as super bowl ticketprices, tips on how to purchase, purchase website, etc. In someembodiments, the generation of this card 908 would cause a task ofpurchasing super bowl tickets to be automatically generated andcompleted. As time passes, when the temporal context 910 changes to thesuper bowl night, when the person 102 enters the same query 902, theintent 912 will likely change to “watching super bowl game.”Accordingly, a different card 914 for online super bowl game streamingis built and presented to the person 102, which may include, forexample, websites currently streaming the game. When the game finishesand the temporal context 916 changes to the day after the super bowlgame, if the person 102 again enters the query 902, the intent 918 willlikely become “reading super bowl game reviews.” A card 920 of superbowl game reviews is constructed and presented to the person 102. It isunderstood that the examples described above are for illustrativepurpose and are not intended to be limiting.

FIG. 10 illustrates an exemplary answer card, according to an embodimentof the present teaching. The answer card 1000 in this example isdynamically constructed on-the-fly in response to the question “when ismy son's soccer game?” Based on the type of the card (answer card) andintent (finding out my son's soccer game date/time), the layout andmodules are determined as shown in FIG. 10. It is understood that theshape, size, and layout of the answer card 1000 is for illustrativepurpose only and may vary in other examples. In some embodiments, theshape, size, and layout may be dynamically adjusted to fit thespecification of the user device (e.g., screen size, display resolution,etc.).

In this example, the answer card includes an answer header module 1002indicating that the topic of the answer card 1000 is “Daniel's (my son'sname identified according to person-centric knowledge) Next SoccerGame.” The direct answer to the question is found from a private emailand provided in the date/time module 1004. Optionally, certain actionsrelated to the answer may be provided as well, such as “add to mycalendar” and “open related emails.” Other information related to thedirect answer is provided in other modules as well. The location module1006 provides the location, address, and map of the soccer game.Information such as location and address may be retrieved from the emailrelated to the game in the private database 548 of the person-centricspace 200, while the map may be retrieved from Google Maps in the publicspace 108. The weather module 1008 provides the weather forecast of thegame day, which may be retrieved from wwww.Weather.com in the publicspace 108. The contact module 1010 shows persons involved in the gameand their contact information retrieved from the email about the gameand private contact lists in the private database 548 of theperson-centric space 200. Optionally, action buttons may be provided tocall the persons directly from the answer card 1000. It is understoodthat the example described above is for illustrative purpose and are notintended to be limiting.

FIG. 11 illustrates an exemplary search result card, according to anembodiment of the present teaching. The search results card 1100 in thisexample is dynamically constructed on-the-fly in response to the query“amy adams.” Based on the type of the card (a search results card) andintent (learning more about actor Amy Adams), the layout and modules aredetermined as shown in FIG. 11. It is understood that the shape, size,and layout of the search results card 1100 is for illustrative purposeonly and may vary in other examples. In some embodiments, the shape,size, and layout may be dynamically adjusted to fit the specification ofthe user device (e.g., screen size, display resolution, etc.). In thisexample, the search results card 1100 includes a header module 1102 withthe name, occupation, and portrait of Amy Adams. The bio module 1104includes her bio retrieved from Wikipedia, and the movies module 1106includes her recent movies. In the movies module 1106, each movie may bepresented in a “mini card” with the movie's name, release year, poster,and brief instruction, which are retrieved from www.IMDB.com. The moviesmodule 1106 is actionable so that a person can swap the “mini cards” tosee information of more her movies. If more modules cannot be shownsimultaneously due to the size of the search results card 1100 (forexample when it is shown on a smart phone screen), tabs (e.g., “Latest,”“About”) may be used to display different modules. It is understood thatthe example described above is for illustrative purpose and are notintended to be limiting.

FIG. 12 depicts an exemplary scheme of automatic online order emailsummary and package tracking via cross-linked data in a person-centricspace, according to an embodiment of the present teaching. Variousaspects of the present teaching are illustrated in FIG. 12 as well asrelated FIGS. 13-15, including cross-linking data from different spaces,entity extraction and building person-centric knowledge representation,dynamic card productions based on intent, answering personal questions,and automatic task generation and completion. In this example, at timet0, an order confirmation email 1202 is received from www.Amazon.com.The email 1202 in the private space is processed to extract and identifyentities. The entities include, for example,seller/vendor—www.Amazon.com, recipient/person—Mike, order date—Dec. 25,2015, item—Contract Case book, shipping carrier—FedEx, trackingnumber—12345678, and estimated delivery date: Jan. 1, 2016. In responseto receiving the email 1202, an email card 1204 summarizing the email1202 is generated and may be provided to Mike automatically or upon hisrequest.

The generation of the email card 1204 in this example automaticallyinitiates the generation of task 1 1206 for checking package status. Thedetails of task 1 1206 will be described in FIG. 13. In order to checkthe package status, one or more cross-linking keys in the packageshipping domain are identified among the entities extracted from theemail 1202. As shown in FIG. 13, the entity “shipping carrier—FedEx” isa cross-linking key used for identifying the website of FedEx 1208 inthe public space, and the entity “tracking number—12345678” is across-linking key used for calling the status check API 1210 of FedEx1208. Based on the tracking number, package status information 1212 isretrieved from FedEx 1208. Different pieces of information from theprivate space and public space are thus cross-linked based on thecross-linking keys and can be projected into the person-centric space.

At time t1, in response to an input from Mike (e.g., a question “whereis my amazon order?”), an answer card 1214 is dynamically generatedbased on private information in the email card 1204 and the publicpackage status information 1212. The answer card 1214 is presented toMike as an answer to his question. In this example, the generation ofthe answer card 1214 automatically initiates another task 2 1216 formonitoring and reporting package status update. According to task 21216, package status information 1212 may be regularly refreshed andupdated according to a schedule (e.g., every two hours) or may bedynamically refreshed and updated upon detecting any event that affectsthe package delivery. In this example, at times t2 and tn, certainevents, such as package being delayed due to severe weather or packagebeing delivered, trigger the generation of notice cards 1218, 1220,respectively. It is understood that the example described above is forillustrative purpose and are not intended to be limiting.

FIG. 13 illustrates an exemplary task with a list of task actions forautomatic package tracking. Task 1 1206 for tracking package status inthis example includes a series of task actions (task action list):identifying shipping carrier 1302, identifying tracking number 1304,obtaining shipping carrier's URL 1306, calling shopping carrier's statuscheck API using the tracking number 1308, extracting status information1310, and filling in the card 1312. Each task action may be associatedwith parameters such as conditions in which the task action is to beexecuted. For example, for task action 1312 “filling in the card,” thecondition may be filling the current package status into an answer cardwhen a question about the package status is asked by the person orfilling the current package status into a notice card of package statusupdate without waiting for any input from the person. Some task actions(e.g., 1302, 1304) may be executed by retrieving relevant informationfrom the person-centric space 200 and/or the person-centric knowledgedatabase 532, while some task actions (e.g., 1308) need to be completedin the public space 108. It is understood that the example describedabove is for illustrative purpose and are not intended to be limiting.

FIG. 14 illustrates a series of exemplary cards provided to a person inthe process of automatic online order email summary and packagetracking. In this example, the email card 1204 is automaticallygenerated responsive to receiving the amazon order confirmation email1202 and summarizes the email 1202 based on the entities extracted fromthe email 1202 and relationships thereof. The email card 1204 includes aheader module “My Amazon Oder” and an order module with entities of itemand price. A “buy it again” action button may be added in the ordermodule. The email card 1204 also includes a shipping module withentities of shipping carrier, tracking number, and scheduled deliverydate.

In this example, the answer card 1214 is generated in response to aquestion from the person about the status of the package. The answercard 1214 includes the header module and order module (but with lessinformation as the order information is not a direct answer to thequestion). The answer card 1214 includes a shipping module with richinformation related to shipping, which is retrieved from both theprivate email 1202 and FedEx 1208. The information includes, forexample, entities of shipping carrier, tracking number, and scheduleddelivery date from the private email 1202, and current estimateddelivery date, status, and location from FedEx 1208.

In this example, multiple notice cards 1218, 1220 are automaticallygenerated in response to any event that affects the status of thepackage. Each notice card 1218, 1220 includes an additional notificationmodule. If any other information is affected or updated due to theevent, it may be highlighted as well to bring to the person's attention.In notice card 1 1218, shipment is delayed due to a winter storm in ABCtown and as a consequence, the current estimated delivery date ischanged according to information retrieved from FedEx 1208. According tonotice card N 1220, the package has been delivered to Mike's home. It isunderstood that the examples described above are for illustrativepurpose and are not intended to be limiting.

FIG. 15 illustrates exemplary entities extracted from a person-centricspace and their relationships established in the process of automaticonline order email summary and package tracking. As described above, theperson-centric knowledge database 532 stores person-centric knowledgeorganized in the form of entity-relationship-entity triples. Entitiesextracted from the amazon order confirmation email 1202 are formed intoentity-relationship-entity triples by the knowledge engine 530. In theexample of FIG. 15, entity “Mike” 1502 from the recipient field of theemail 1202 is determined as the person using the person-centric INDEXsystem 202, and entity “FedEx” 1504 is determined as a shipping carrierwith a short-term relationship 1506 with entity “Mike” 1502. Attributes1508 may be associated with the relationship 1506 including, forexample, temporal attribute, tracking number, shipping item, sender,etc. These attributes may include related entities extracted from theemail 1202 and any other attributes inferred based on the relationship1506. It is noted that the relationship 1506 between entity “Mike” 1502and entity “FedEx” 1504 is a short-term, temporary relationship in thesense that the relationship 1506 will become invalid after the shipmentis completed, as indicated by the temporal attribute. In this example,entity “Mike” 1502 and another entity “Amazon” 1510 establish along-term relationship 1512 with a different set of attributes 1514thereof. The attributes 1514 include, for example, the temporalattribute, item, item rating, and so on. The relationship 1512 islong-term in this example because Mike has been repeatedly ordered goodsfrom Amazon, which has become his behavior pattern or preference. It isunderstood that the examples described above are for illustrativepurpose and are not intended to be limiting.

More detailed disclosures of various aspects of the person-centric INDEXsystem 202 are covered in different U.S. patent applications, entitled“Method and system for associating data from different sources togenerate a person-centric space,” “Method and system for searching in aperson-centric space,” “Methods, systems and techniques for providingsearch query suggestions based on non-personal data and user personaldata according to availability of user personal data,” “Methods, systemsand techniques for personalized search query suggestions,” “Methods,systems and techniques for ranking personalized and generic search querysuggestions,” “Method and system for entity extraction anddisambiguation,” “Method and system for generating a knowledgerepresentation,” “Method and system for generating a card based onintent,” “Method and system for dynamically generating a card,” “Methodand system for updating an intent space and estimating intent based onan intent space,” “Method and system for classifying a question,”“Method and system for providing synthetic answers to a personalquestion,” “Method and system for automatically generating andcompleting a task,” “Method and system for online task exchange,”“Methods, systems and techniques for blending online content frommultiple disparate content sources including a personal content sourceor a semi-personal content source,” and “Methods, systems and techniquesfor ranking blended content retrieved from multiple disparate contentsources.” The present teaching is particularly directed to classifying aquestion using person-centric knowledge associated with the person whoasked the question and providing synthetic answers to a personalquestion.

Internet technology facilitates information publishing, informationsharing, and data exchange in various spaces and among various types ofusers. Some information is publicly available to all Internet users. Forexample, services provided through FedEx.com and fashion clothes carriedby Nordstrom.com are publicly accessible to all Internet users. Yet someinformation is completely private and can only be accessible to a user.For example, personal emails in Yahoo! mail, Gmail, Outlook, etc. andpersonal calendar events are only accessible to the user when he or shelogs in using his or her private credentials. Although FedEx.com andNordstrom.com provide public information, these commercial websites mayalso host private information associated with individual users at thesame time (e.g., a personal account with FedEx.com or a personal accountwith Nordstrom.com). In some circumstances, information is semi-privateand is shared with a certain group of people. For example, Facebookuser, Mark, may choose to set his information as public so everyone cansearch and view; while another Facebook user, Jennie, may choose toshare some of her personal information with her selected friends.Information shared by Jennie among her selected friends is semi-private.

When a user asks a question via an Internet browser from a personaldevice, answers are searched from all spaces connected to Internetincluding a public space, a semi-private space, a private space, etc. Ingeneral, the question is initially analyzed and understood beforeanswers are searched. When the question is more like a personalquestion, for example, “when is my son's football game?” searchingvastly in Internet is unnecessary and inefficient. Further, answers fromdifferent spaces are presented to the user in an order of relevance,which does not take into account the association between the questionand the user. Therefore, locating the desired answer from the presentedmultiple answers may be difficult.

There is a need for improvements over the conventional approaches toprovide synthetic answers to a personal question.

The present teaching discloses a method and a system for answeringpersonal questions. To achieve this, a question is first classified intoa personal question or a non-personal question based on a model. Themodel may be one or more computer algorithms such as a machine learningalgorithm. Once the question is classified as personal, various featuresincluding entities and relations are extracted from the question. Theextracted entities and relations are used to traverse a person-centricknowledge database, which stores person-centric knowledge extracted fromprivate spaces and semi-private spaces, such as emails, blog posts, andcalendar events, in a pre-defined form. In some embodiments, theperson-centric information may be stored in a triple format includingone or more entities and a relation between the one or more entities.When an exact match of relation and entity are found, an answer isreturned. When there is no exact match, a similarity between thequestion and answer triples is taken into consideration and used to findthe candidate answers. To measure the similarity, words embedded over alarge corpus of user texts may be collected and trained. The presentteaching provides an efficient solution to search for answers topersonal questions and increases user engagement and contentunderstanding.

Additional novel features will be set forth, in part, in the descriptionwhich follows and, in part, will become apparent to those skilled in theart upon examination of the following and the accompanying drawings.Also, it may be learned by production or operation of the examples. Thenovel features of the present teachings may be realized and attained bypractice or use of various aspects of the methodologies,instrumentalities and combinations set forth in the detailed examplesdiscussed below.

FIG. 16 depicts an exemplary system diagram for question classificationand providing synthetic answers to a personal question, according to anembodiment of the present teaching. The system comprises one or moreelements of the person-centric INDEX system 202 illustrated in FIG. 5including the user interface 502, the Q/A interface 508, the Q/A engine518, the dynamic card builder 528, the person-centric knowledgeretriever 526, the knowledge engine 530, and the cross-linking engine542. The basic functions of the noted elements are described above, andtherefore, are not detailed herein. For the aspect of questionclassification and providing synthetic answers, user interface 502 isconfigured to receive a question inputted by a person 102; Q/A interface508 is configured to parse the question and classify the question into apersonal question or a non-personal question based on establishedperson-centric knowledge; Q/A engine 518 is configured to generate oneor more synthetic answers to the classified question and present the oneor more synthetic answers in a format, such as an f answer card, to theperson 102; person-centric knowledge retriever 526 is configured toretrieve person-centric knowledge associated with the person 102 from aperson-centric knowledge database 532 upon determining that the questionis classified as a personal question and provide the person-centricknowledge associated with the person 102 to Q/A engine 518; dynamic cardbuilder 528 is configured to dynamically build one or more answer cardsbased on the question classification and provide the one or more answercards to Q/A engine 518; knowledge engine 530 is configured to analyzeinformation collected from a public space 108, a semi-private space 106,and a private space 104, create person-centric knowledge about theperson 120 based on the analysis, and store and/or update theperson-centric knowledge in person-centric knowledge database 532; andcross-linking engine 542 is configured to collect information associatedwith the person 102 from public space 108, semi-private space 106, andprivate space 104.

The system for question classification and providing synthetic answersmay further comprises a task-driven answer search engine 1606 configuredto search answers to a personal question from all available spaces whenQ/A engine 518 determines that the answers obtained from theperson-centric knowledge associated with the person 102 are no longervalid. Task-driven answer search engine 1606 generates a list of tasksto search answers to the personal question from semi-private space 106and private space 104. In some embodiments, task-driven answer searchengine 1606 may search answers to the personal question from allavailable resources in addition to semi-private space 106 and privatespace 104. Task-driven answer search engine 1606 may use an availabletemplate from a task template database 538 or create a new template ifit is a new task. Task-driven answer search engine 1606 may correspondto one or more elements of the person-centric INDEX system 202illustrated in FIG. 5, for example, a task generation engine 520 and atask completion engine 522. The system components described above arefor illustrative purpose; however, the present teaching is not intendedto be limiting and may comprises and/or cooperate with other elements toprovide synthetic answers to a personal question. It is understood thatalthough the present teaching related to question classification will bedescribed below in detail as part of the person-centric INDEX system202, in some embodiments, the system and method disclosed in the presentteaching can be independent from the person-centric INDEX system 202 oras part of another system.

FIG. 17 depicts a detailed system diagram for question classificationand providing synthetic answers to a personal question, according toanother embodiment of the present teaching. The system comprises a userinterface 502, a question parser 1702, a question classifier 1704, aquestion classification archive 1706, a synthetic answer engine 1708, aperson-centric answer search engine 1710, an answer card generator 1712,a public answer search engine 1714, a dynamic card builder 528, and aperson-centric knowledge retriever 526. The question parser 1702 isconfigured to analyze the textual information of the question intological syntactic components and identity one or more features from thequestion. Question parser 1702 may implement one or more computationalmethods to analyze the textual information of the question, for example,a machine learned sequence tagger to identify and categorize entitiespresented in the question. Question parser 1702 may further implement anatural language parser to extract one or more relations between theidentified entities. Question classifier 1704 is configured to classifythe question into a personal question or a non-personal question using aselected model based on the question. Question classification archive1706 is configured to temporarily archive the classification results forthe purpose of training the person-centric knowledge. Person-centricanswer search engine 1710 is configured to search one or moreperson-centric answers with respect to the person 102 based oninformation retrieved from a person-centric knowledge database 532 byperson-centric knowledge retriever 526. Public answer search engine 1714is configured to search for answers from all available sources, forexample, from a public space 108, a semi-private space 106, and/or aprivate space 104. The searched person-centric answers and publicanswers are provided to synthetic answer engine 1708. Synthetic answerengine 1708 is configured to synthesize and prioritize the receivedperson-centric answers and the public answers and provide the syntheticanswers to answer card generator 1712. Answer card generator 1712obtains one or more answer card templates from dynamic card builder 528and generates one or more integrated answer cards embedded with thesynthetic answers, respectively. The one or more integrated answer cardsare further presented to the person via user interface 502 as a responseto the question. In some embodiments, all synthetic answers use a sameanswer card template; yet in some other embodiments, more than oneanswer card template are used to present the synthetic answers. In someembodiments, the synthetic answers may be presented in a combination ofdifferent formats including answer cards, list of answers, graphicrepresentations of answers, etc.

In some embodiments, the system for question classification andproviding synthetic answers to a personal question may further comprisesa task generation engine 520 and a task completion engine 522. When itis determined that the received person-centric answer is no longer validand/or out-of-date, synthetic answer engine 1708 instructs taskgeneration engine 520 to generate a list of tasks to search for a validversion and/or an up-to-date version related to the person-centricanswer from all available resources. Task completion engine 522 isconfigured to perform the list of tasks and provide a valid versionand/or an up-to-date version related to the person-centric answer tosynthetic answer engine 1708. In some embodiments, task generationengine 520 may search a task template database 538 and determine whethera template with respect to the generated list of tasks is available. Ifan available template is found in template database 538, task generationengine 520 integrates the list of tasks with the available template andforwards the integrated list of tasks to task completion engine 522. Ifan available template is not found in template database 538, taskgeneration engine 520 creates a task template with respect to theperson-centric answer, integrates the list of tasks with the newlycreated template, and forwards the integrated list of tasks to taskcompletion engine 522.

In some embodiments, the system for question classification andproviding synthetic answers to a personal question may further comprisesa Q/A learning engine 1726 and an updating module 1728. Q/A learningengine 1726 receives inputs from question classification archive 1706,synthetic answer engine 1708, dynamic card builder 528, and/or othercomponents of the person-centric INDEX system 202 directly orindirectly. Q/A learning engine 1726 is configured to analyzeinformation related to the question classification, the syntheticanswers to a personal question, person's interactions with the providedanswer cards, etc. based on the received inputs, and determine whetherthe person-centric knowledge associated with the person 102 needs to beupdated in person-centric knowledge database 532, and/or whether thetask templates need to be updated in task template database 538. Once itis determined that the person-centric knowledge and/or the tasktemplates need to be updated, updating module 1728 updatesperson-centric knowledge database 532 and/or task template database 538accordingly.

FIG. 18 is a flowchart of an exemplary process for questionclassification and providing synthetic answers to a personal question,according to an embodiment of the present teaching. A question from aperson is received at 1802. A determination as to whether the questionis personal or non-personal is made at 1804. If it is determined thatthe question is personal, answers to the personal question is searchedfrom a person-centric knowledge database at 1806. If it is determinedthat the question is non-personal, answers to the non-personal questionis searched from a public space at 1814. A determination as to whetherthe searched answer from the person-centric knowledge database 532 isvalid is made at 1808. If it is determined that the searched answer fromthe person-centric knowledge database 532 is valid, answers are obtainedfrom the person-centric knowledge database 532 at 1816. If it isdetermined that the searched answer from the person-centric knowledgedatabase 532 is invalid, a task list to search for answers is generatedat 1810. The task list is performed and answers are searched from allavailable spaces at 1812. One or more answer cards with integratedsynthetic answers are generated at 1820. The person-centric knowledgedatabase 532 is updated at 1822. The task template database 538 isupdated at 1824. The card template database is updated at 1826.

FIG. 19 depicts an exemplary system diagram of a question classifier,according to an embodiment of the present teaching. Question classifier1704 as illustrated in FIG. 17 comprises a feature extractor 1902, aquestion pattern extractor 1904, a person-centric feature archive 1906,a personal question pattern archive 1908, a classification knowledgeretriever 1910, a feature comparing module 1912, a pattern comparingmodule 1914, a classification decision module 1916, and a questionclassification archive 1706. Feature extractor 1902 is configured toextract one or more features from the received question. The one or morefeatures may be categorized into personal entities, non-personalentities, dates and/or time, locations, verbs and/or action words,relations between personal entities and/or non-personal entities, etc.Personal entities may be a person's name, a person's nickname, aperson's user ID in the semi-private space such as Twitter, Facebook,etc. Non-personal entities may be a name, a nickname, and/or anabbreviation of a business entity, a building and/or a construction, abotanical entity, an animal, and/or any non-human being entities.Feature extractor 1902 may be further configured to determine therelations between any personal entities and/or non-personal entitiesbased on the categorized features. The extracted features and theassociated relations are further used to construct a plurality ofarguments that define person-centric knowledge with respect to aparticular person.

Question pattern extractor 1904 is configured to extract one or morepatterns related to the question. For example, the received question is“How to write a good resume?” and the pattern of the question may beextracted as a pattern of using interrogative word “How.” Based on thequestions being asked over the Internet, most of the questions with thepattern of using interrogative word “How” receive answers from allavailable resources. Accordingly, a question with the pattern of usinginterrogative word “How” may be classified as a non-personal question.In some embodiments, question pattern extractor 1904 may determinewhether the question contains some type of entities, for example, anentity associated with the person who asked the question, or an entityrelated to a celebrity, etc. In some other embodiments, question patternextractor 1904 may combine the knowledge of the interrogative word andthe entities to generate patterns of questions.

Classification knowledge retriever 1910 is configured to retrieveperson-centric knowledge related to question classification from aperson-centric knowledge database 532. Classification knowledgeretriever 1910 may function as a part of person-centric knowledgeretriever 526. In some embodiments, classification knowledge retriever1910 may function independently from person-centric knowledge retriever526 but exchange data with person-centric knowledge retriever 526. Theretrieved classification knowledge is saved in one or more knowledgearchives based on the categories of the classification knowledge. Insome embodiments, the retrieved classification knowledge is saved inperson-centric feature archive 1906 and personal question patternarchive 1908, respectively. Person-centric feature archive 1906 andpersonal question pattern archive 1908 may be stored in a cached memoryof the system. Information saved in person-centric feature archive 1906and personal question pattern archive 1908 may be updated dynamically inaccordance with the question and/or the user who asked the question.Feature comparing module 1912 compares the retrieved features with thearchived person-centric feature and pattern comparing module 1914compares the retrieved question pattern with the archived personalquestion patterns. The comparison results are sent to classificationdecision module 1916 to determine whether the question is in a personalquestion category or a non-personal question category. Classificationdecision module 1916 may use different decision schemes based on thereceived comparison results. For example, classification decision module1916 may determine the question classification based on the featurecomparison result or the question pattern comparison result. In someembodiments, classification decision module 1916 may combine bothresults from the feature comparison and the question pattern comparisonto determine the question classification. It should be appreciated thatthe above description related to question classification is forillustrative purpose and is not intended to be limiting. The presentteaching may classify a question into a personal category or anon-personal category based on other knowledge associated with thequestion and/or the user asked the question. In some embodiments, thepresent teaching may further classify the personal category or thenon-personal category into one or more sub-categories in order toprovide more satisfying answers to the person.

In some embodiments, question classifier 1704, as illustrated in FIG.17, may further comprise a classification knowledge learning engine1920. Classification knowledge learning engine 1920 is configured toanalyze the question being asked, the classification of the question,answers to the question, and/or person's interaction with the answers,etc. to determine whether the question classification is appropriate ornot. Information related to the question classification is saved inquestion classification archive 1706, which includes a complete questionbeing asked by a person, parsed question components, extracted features,extracted question pattern, classification decision, i.e., personal ornon-personal, etc. If the question is classified as personal,person-centric answer search engine 1710 searches for one or moreperson-centric answers based on the person-centric knowledge. If thequestion is classified as non-personal, public answer search engine 1714searches for one or more public answers from all available resources.The plurality of person-centric answers and public answers are presentedto the person. In some embodiments, the plurality of person-centricanswers and public answers are presented together with one or moresponsored search results. Upon receiving the answers via user interface502, the person selects and reviews one or more of the presented answersand/or sponsored search results. Such person interactions are collectedand sent to classification knowledge learning engine 1920.Classification knowledge learning engine 1920 analyzes informationrelated to the question classification, the plurality of person-centricanswers and public answers, and information related to personinteractions with the presented answers. Classification knowledgelearning engine 1920 determines whether the person is satisfied with thepresented answers and whether the question classification is appropriateor not. The classification knowledge is further updated inperson-centric knowledge database 532 directly or indirectly.Classification knowledge learning engine 1920 as illustrated functionsas a part of question classifier 1704, and therefore, the classificationknowledge data is communicated to knowledge engine 530 to updateperson-centric knowledge database 532. However, the present teaching isnot intended to be limiting. In some embodiments, classificationknowledge learning engine 1920 may be integrated to knowledge engine 530as one function module, and the classification knowledge data is updateddirectly to person-centric knowledge database 532.

FIG. 20 is a flowchart of an exemplary process of classifying aquestion, according to an embodiment of the present teaching. A questionis initially received from a person at 2002. Further, one or morefeatures are extracted from the question at 2004. The person-centricfeatures are retrieved from person-centric knowledge database 532 at2006. A decision is made as to whether at least one feature extractedfrom the question matches a person-centric feature from person-centricknowledge database 532 at 2008. If at least one feature extracted fromthe question matches a person-centric feature from person-centricknowledge database 532, a decision is made that the question is apersonal question at 2010. If no features extracted from the questionmatches a person-centric feature from person-centric knowledge database532, a decision is made that the question is a non-personal question at2012.

FIG. 21 depicts an exemplary person-centric knowledge for classifying aquestion, according to an embodiment of the present teaching. Forexample, when a question “How to write a good resume?” is received, afeature of “resume” is extracted from the question. If no match is foundfrom person-centric knowledge database 532, the question is classifiedas non-personal. In another example, when a question “When is TaylorSwift's birthday?” is received, a feature of “Taylor Swift” and afeature of “birthday” are extracted from the question. If no match withrespect to the two extracted features is found from person-centricknowledge database 532, the question is classified as non-personal. Inyet another example, when a question “E-ticket for Taylor Swift'sconcert?” is received, “Taylor Swift,” “E-ticket,” and “concert” areextracted from the question. When “E-ticket” matches a person-centricfeature from person-centric knowledge database 532, the question isclassified as personal. The person may ask any types of questions, andthe examples described above are for illustrative purpose and are notintended to be limiting.

FIG. 22 is a flowchart of an exemplary process of classifying aquestion, according to another embodiment of the present teaching. Aquestion is received from a person at 2202. Further, a question patternis determined from the question at 2204. The personal question patternsare retrieved from person-centric knowledge database 532 at 2206. Adecision is made as to whether the question pattern matches a personalquestion pattern from person-centric knowledge database 532 at 2208. Ifthe question pattern matches a personal question pattern fromperson-centric knowledge database 532, a decision is made that thequestion is a personal question at 2210. If the question pattern doesnot match a personal question pattern from person-centric knowledgedatabase 532, a decision is made that the question is a non-personalquestion at 2212.

FIG. 23 depicts an exemplary person-centric knowledge for classifying aquestion, according to another embodiment of the present teaching. Forexample, when a question “How to write a good resume?” is received, aquestion pattern of using interrogative word “How” is determined. Thepersonal questions patterns retrieved from person-centric knowledgedatabase 532 does not have a pattern that uses interrogative word “How,”and therefore, the question is classified as non-personal. In anotherexample, when a question “When is Taylor Swift's birthday?” is received,a question pattern is determined to have a celebrity name. The personalquestions patterns retrieved from person-centric knowledge database 532do not consider the celebrity name as a personal question pattern, andtherefore, the question is classified as non-personal. In yet anotherexample, when a question “E-ticket for Taylor Swift's concert?” isreceived, a question pattern is determined to have celebrity name andpersonal event information which is defined as a personal questionpattern. Therefore, the question is determined as personal. The systemmay construct all types of personal question patterns. The examplesdescribed above are for illustrative purpose and are not intended to belimiting.

FIG. 24 depicts an exemplary system diagram of a machine learning systemfor training the person-centric knowledge for classifying a question,according to an embodiment of the present teaching. Classificationknowledge learning engine 1920 as illustrated in FIG. 19 comprises auser interaction collector 2402, a dwell-time analyzer 2404, asequential action analyzer 2406, a classification knowledge trainingmodule 2408, and an updating unit 2410. User interaction collector 2402is configured to communicate with user interface 502 to collect theperson's interactions with the presented answers. The presented answersinclude a plurality of person-centric answers and public answers, eachof which in a format of an answer card. Dwell-time analyzer 2404 isconfigured to analyze the time that the person spent on reviewing ananswer card, information related to the answer card, and/or a webpageredirected from the answer card. The longer the dwell-time, the moreinterest that the user may have on a particular answer card. Sequentialaction analyzer 2406 is configured to analyze a series of userinteractions sequentially occurred after reviewing an answer card. Ifthe sequentially occurred user interactions are closely related to aperson-centric answer card, a decision is made that the classificationof the question as a personal category is more likely to be appropriate.If the sequentially occurred user interactions appear random, a decisionis made that the classification of the question as a personal categorymay be inappropriate.

Classification knowledge training module 2408 is configured to receiveinformation related to the question classification from questionclassification archive 1706, searched results such as person-centricanswers and/or public answers from person-centric answer search engine1710 and public answer search engine 1714, dwell-time analysis resultfrom dwell-time analyzer 2404, and sequentially occurred userinteraction analysis result from sequential action analyzer 2406.Classification knowledge training module 2408 is further configured totrain the received data and generate classification knowledge as a basisfor further question classification. Classification knowledge trainingmodule 2408 may use computer programs such as machine learning system totrain the received data. Classification knowledge training module 2408may train the data based on the dwell-time analysis result or thesequential action analysis result alone. In some other embodiments,classification knowledge training module 2408 may combine the dwell-timeanalysis result and the sequential action analysis result to train thedata. Updating unit 2410 is configured to update person-centricknowledge database 532 with the newly-trained person-centric knowledge.It should be appreciated that the description above is for illustrativepurpose. Various characters related to the person interactions may beanalyzed to determine whether the user is satisfied with the presentedanswers, which in turn indicates whether the question classification isappropriate or not.

FIG. 25 is a flowchart of an exemplary process of a machine learningprocess for training the person-centric knowledge for classifying aquestion, according to an embodiment of the present teaching. Initially,a question is received from the user at 2502. The question is classifiedto a personal category or a non-personal category at 2504. If thequestion is classified as a non-personal question, answers are obtainedfrom all available resources at 2506. If the question is classified as apersonal question, answers are obtained from a person-centric knowledgedatabase at 2508. One or more answer cards with integrated answers arepresented to the user at 2510. Further, one or more user interactionswith the presented answers are collected at 2512. User feedback on thepresented answers based on the collected one or more user interactionsare analyzed at 2514. The question classification temporarily archivedis refined based on the analysis at 2516. Classification knowledge withrespect to the question is generated at 2518. The person-centricknowledge database is updated at 2520.

FIG. 26 is a flowchart of an exemplary process of a machine learningprocess for training the person-centric knowledge for classifying aquestion, according to another embodiment of the present teaching. Aquestion is initially classified as a personal category at 2602. Userdwell-time associated with each presented answer card is analyzed at2604. User sequential actions associated with each presented answer cardis analyzed at 2606. A decision is made as to whether the user issatisfied with the presented answers at 2608. If the person is satisfiedwith the presented answers, the question classification is maintained at2610, and the question is saved in the person-centric knowledge databaseas a personal category at 2612. If the user is unsatisfied with thepresented answers, the question is reclassified as a non-personalquestion at 2614 and is not saved in the person-centric knowledgedatabase.

FIG. 27 depicts an example of determining the classification accuracybased on two parameters, according to another embodiment of the presentteaching. As shown in the Table, the dwell-time and the sequentialaction are two parameters employed to determine whether a questionclassification is appropriate or not. When the person's dwell-time on aperson-centric answer card is long and the next action is to clickanother person-centric answer card, it is determined that theclassifying of the question as “personal” is highly appropriate. Inanother example, when the user's dwell-time on a person-centric answercard is moderate and the next action is to ask a new question, it isdetermined that the classifying of the question as “personal” is lessappropriate. The determination of whether a question classification isappropriate or not may be performed by one or more computer algorithms,for example, machine learning algorithm or fuzzy logic algorithm. Theparameters and computer algorithms used to determine whether a questionclassification is appropriate or not are not limited to the examples setforth above. The present teaching may use various other features relatedto the user interactions and apply different computer algorithms to makethe decision.

FIG. 28 depicts an exemplary system diagram for providing syntheticanswers to a personal question, according to an embodiment of thepresent teaching. The synthetic answers to a personal question aregenerated via person-centric answer search engine 1710, which comprisesan entity extractor 2802, an entity categorizer 2804, a relationextractor 2806, an exact match decision module 2808, a fuzzy matchdecision module 2810, and a personal question variation module 2812.After a question is classified as a personal question, entity extractor2802 extracts all entities from the question and entity categorizer 2804categorizes the extracted entities into different categories. Entityextractor 2802 may apply a machine learning sequence tagger to identifyall entities from the question; however, the methods of identifying theentities may include other computer algorithms. Entity extractor 2802and entity categorizer 2804 as illustrated are two separate components;however, the present teaching is not intended to be limiting. Entityextractor 2802 and entity categorizer 2804 may be integrated as onefunction component. Relation extractor 2806 is configured to extract oneor more relations between the identified entities from the question.Relation extractor 2806 may use a natural language parser to extract therelation information; however, relation extractor 2806 may use otherparsing techniques to obtain the relation information.

Exact match decision module 2808 is configured to determine whetherthere is an exact match between one or more of the extracted entitiesand the extracted relations based on information provided byperson-centric knowledge retriever 526 who retrieves such informationfrom person-centric knowledge database 532. When an exact match is notfound, fuzzy match decision module 2810 makes further decision as towhether an answer to a variation of the question exists inperson-centric knowledge database 532. One or more variations of thequestion are generated by personal question variation module 2812 basedon selected variation criteria. Results from exact match decision module2808 and fuzzy match decision module 2810 are sent to synthetic answerengine 1708. In some embodiments, synthetic answer engine 1708 generatesa plurality of person-centric answers based on the results from exactmatch decision module 2808 and fuzzy match decision module 2810. In someother embodiments, synthetic answer engine 1708 initiates a taskrequest, and sends the task request to task-driven answer search engine1606. Task-driven answer search engine 1606 searches for answers inaccordance with the received task request and provides the searchedanswers to synthetic answer engine 1708.

FIG. 29 depicts an exemplary system diagram for providing syntheticanswers to a personal question, according to another embodiment of thepresent teaching. According to the illustrated embodiment, aperson-centric entity retriever 2906 retrieves all person-centricentities associated with the user who asked the personal question fromperson-centric knowledge database 532; a person-centric relationretriever 2908 retrieves all person-centric relations associated withthe user who asked the personal question from person-centric knowledgedatabase 532; an entity comparing module 2902 compares the entitiesextracted from the question with all person-centric entities associatedwith the user and determines whether there is at least one exact match;a relation comparing module 2904 compares the relations extracted fromthe question with all person-centric relations associated with the userand determines whether there is at least one exact match; and a matchdecision unit 2910 determines whether there exists an entity-relationargument in person-centric knowledge database 532 based on thecomparison results. If there is an exact match with respect to oneentity and an exact match with respect to one relation associated withthe one entity, match decision unit 2910 determines that there exists anentity-relation argument in person-centric knowledge database 532. Insome embodiments, if there are two exact matches with respect to twoentities, match decision may also determine that there exists anentity-relation argument in person-centric knowledge database 532. Thecriteria to determine whether there exists an entity-relation argumentin person-centric knowledge database 532 are not limited to the examplesdescribed above, and may be based on other features extracted from thequestion and/or any combination of the features extracted from thequestion.

FIG. 30 depicts an exemplary system diagram for providing syntheticanswers to a personal question, according to another embodiment of thepresent teaching. When exact match decision module 2808 determines thatno exact match is found, fuzzy match decision module 2810 receives frompersonal question variation module 2812 one or more variations of theentities and relations extracted from the personal question, andperforms a fuzzy decision on whether an entity-relation argument existsin person-centric knowledge database 532. Personal question variationmodule 2812 comprises an entity variation module 3002, a relationvariation module 3004, an entity variation library 3006, and a relationvariation library 3008. An entity extracted from the question istransformed to one or more entity variations based on information fromentity variation library 3006, and a relation extracted from thequestion is transformed to one or more variations based on informationfrom relation variation library 3008. Variations of an entity may be anick name of the entity, an abbreviation of the entity, a relation withanother entity, an adjective of the entity, a synonym of the entity,another entity with similar meaning, etc. Variations of a relation maybe a linguistic format of the relation, an entity, a synonym of therelation, another entity and/or relation with similar meaning, etc. Thevariations of the entities and relations are initially collected fromInternet and are dynamically updated with respect to the question beingasked. In some embodiments, words embedded over a large corpus of usertexts are collected and trained to measure similarities between thesewords. Entity variation library 3006 and relation variation library 3008as illustrated are separate components; however, they may also beintegrated into one variation library. In some embodiments, entityvariation library 3006 and relation variation library 3008 may beincorporated into person-centric knowledge database 532. Each of theentity variation is compared with the person-centric entities todetermine whether there is a match, and each of the relation variationis compared with the person-centric relations to determine whether thereis a match. If there is a match with respect to one entity variation anda match with respect to one relation variation associated with the oneentity, fuzzy decision unit 3010 determines that there exists anentity-relation argument in person-centric knowledge database 532. Insome embodiments, if there are two matches with respect to thevariations of two entities, match decision may also determine that thereexists an entity-relation argument in person-centric knowledge database532. In some embodiments, after comparing all entity variations andrelation variations, fuzzy decision unit 3010 may determine that morethan one entity-relation argument exists in person-centric knowledgedatabase 532. The criteria to determine whether there exists anentity-relation argument in person-centric knowledge database 532 arenot limited to the examples described above, and may be based on otherfeatures extracted from the question and/or any combination of thefeatures extracted from the question.

FIG. 31 depicts an exemplary system diagram for generating syntheticanswers to a personal question, according to an embodiment of thepresent teaching. Synthetic answer engine 1708 receives inputs fromexact match decision module 2808 and fuzzy match decision module 2810.Synthetic answer engine 1708 determines whether a task request to searchfor up-to-date answers needs to be generated. The inputs from exactmatch decision module 2808 and fuzzy match decision module 2810 includeone or more entries, each of which having at least an entity and arelation associated with the entity, and the entity and the relation areboth found in person-centric knowledge database 532. The inputs may haveother formats of the matched entities and relations and are not limitedto the description above. If it is determined that there is a need tosearch for up-to-date answers to the personal question, synthetic answerengine 1708 sends the task request to task-driven answer search engine1606 to perform the task.

Synthetic answer engine 1708 comprises an entity-relation argumentretriever 3102, a relation validator 3104, a ranking module 3106, and ananswer content generator 3108. Entity-relation argument retriever 3102is configured to retrieve one or more arguments from person-centricknowledge database 532, each of which has a matched entity and a matchedrelation. Relation validator 3104 is configured to determine whether therelation between two entities is valid and/or up-to-date. As somerelation may be a long-term relation and last long, such as the relationbetween Joe, a regular Amazon shopper and Amazon, other relations may bea short-term or a temporary relation, such as the relation between Joeand FedEx. An argument associated with Joe and FedEx may be set asinvalid or out-of-date once FedEx completed a delivery. When a newquestion triggers extracting the argument of Joe-FedEx, relationvalidator 3104 sends a task request to task generation engine 520 tosearch for an up-to-date answer from the FedEx website if the Joe-FedExrelation is discovered as invalid or out-of-date. Task generation engine520 creates a list of tasks that need to be completed in order to searchthe up-to-date answer from the FedEx website. Task generation engine 520may use a template from task template database 538 or create a newtemplate. Task completion engine 522 further performs each of the listedtasks in order to obtain the up-to-date answer. In some embodiments, ifthe relation is determined to be valid, the one or more entity-relationarguments are sent to ranking module 3106 to prioritize the arguments.In some embodiment when there is a newly searched up-to-date answer, theone or more entity-relation arguments and the newly searched up-to-dateanswer are both sent to ranking module 3106 to prioritize. The newlysearched up-to-date answer is further sent to Q/A learning engine 1726to update person-centric knowledge database 532 and task templatedatabase 539 via updating module 1728. The functions of Q/A learningengine 1726 and updating module 1728 are described above, and thus, arenot detailed herein.

Answer content generator 3108 is configured to generate the contentrelated to the answer, which may include textual description of theanswer, information related to the answer, recommended information thatmay interest the user, etc. The answer content is further forwarded toanswer card generator 1712 to generate one or more answer cards. Thefunctions of answer card generator 1712 and dynamic card builder 538 aredescribed foregoing, and therefore, are not detailed herein.

FIG. 32 is a flowchart of an exemplary process for generating syntheticanswers to a personal question, according to an embodiment of thepresent teaching. One or more entities are extracted from the personalquestion at 3202. The one or more entities are further categorized intoa plurality of categories at 3204. Based on the extracted entities andthe categories, one or more relations between the one or morecategorized entities are extracted from the question at 3206. Meanwhile,a plurality of person-centric entities associated with the user areretrieved from the person-centric knowledge database at 3208, and aplurality of person-centric relations among the plurality ofperson-centric entities are retrieved from the person-centric knowledgedatabase at 3210. A decision as to whether a match is found betweenentities extracted from the question and the plurality of person-centricentities is made at 3212. If there is an entity match, a decision as towhether a match is found between relations extracted from the questionand the plurality of person-centric relations is made at 3214. If noentity match is found, a list of variations of the entity extracted fromthe question is generated at 3216. A further decision as to whether amatch is found between the entities variations from the question and theplurality of person-centric entities is made at 3218. If no relationmatch is found, a list of variations of the relations extracted from thequestion is generated at 3222. A further decision as to whether a matchis found between the relation variations from the question and theplurality of person-centric relations is made at 3224. Further, one ormore entity-relation arguments are retrieved from the person-centricknowledge database at 3220. A decision as to whether the relation isstill valid is made at 3226. If the relation is determined to be valid,answer content is generated from the entity-relation argument at 3228.If the relation is determined to be not valid, a list of tasks isgenerated to search answer content from all available resources at 3230.

FIG. 33 depicts an example of generating person-centric knowledge,according to an embodiment of the present teaching. According to theillustrated embodiment, person-centric knowledge 3306 is constructedbased on information collected from private space 104 and semi-privatespace 106. Feature extractor 3302 is configured to collect emailinformation 3320, calendar information 3322, social activity information3324, etc. associated with a user from private space 104 andsemi-private space 106. Feature categorizer 3304 is configured tocategorize the collected information into a plurality of categoriesincluding but not limited to a person's name 3308, a non-personal entity3310, time and/or date 3312, location 3314, relation 3316, event 3318,etc. The categorized features associated with the user are stored asperson-centric knowledge associated with the user. It should beappreciated that the examples to construct person-centric knowledge arefor illustrative purpose and are not intended to be limiting. Thepresent teaching may acquire other features from private space 104 andsemi-private space 106 to construct person-centric knowledge.

FIG. 34 depicts an exemplary person-centric knowledge for providingsynthetic answers to a personal question, according to an embodiment ofthe present teaching. In one example, four entities are extracted from apreviously asked question including Taylor Swift, Verizon Center,Concert, and Oct. 10, 2015, and one relation is extracted as Booke-ticket. Two arguments are created with respect to the extractedentities and relation. One argument states that Taylor Swift's concertis on Oct. 10, 2015, and another argument states that Taylor Swift'sconcert is at Verizon Center. In another example, three entities areextracted from a previously asked question including Uncle Sam, Annie,and 55, and one relation is extracted as wife. One argument states thatAnnie is Uncle Sam's wife, and another argument states that Annie is 55years old. The examples shown in FIG. 34 are for illustrative purpose;however, the present teaching is not intended to be limiting, and mayhave more or less entity categories and arguments in the person-centricknowledge.

FIG. 35 depicts an exemplary person-centric knowledge for providingsynthetic answers to a personal question, according to anotherembodiment of the present teaching. Person-centric knowledge 3306 maycomprise one or more variations of an entity, for example, VerizonCenter may has variations as Stadium, Center, and Washington D.C., andRecreation Center may has variations as Gym, Exercise, Swimming pool,and Racquet Court. The examples shown in FIG. 35 are for illustrativepurpose; however, the present teaching is not intended to be limiting,and may have more or less entity variations in the person-centricknowledge.

FIG. 36 depicts an exemplary person-centric knowledge for providingsynthetic answers to a personal question, according to anotherembodiment of the present teaching. Person-centric knowledge 3306 maycomprise one or more variations of a relation, for example, Book mayhave variations as Purchase, Order, Buy, and Reserve, and Call may havevariations as Conference, Meet, and Discuss. The examples shown in FIG.36 are for illustrative purpose; however, the present teaching is notintended to be limiting, and may have more or less relation variationsin the person-centric knowledge.

FIG. 37 depicts an example of obtaining synthetic answers to a personalquestion based on the person-centric knowledge, according to anembodiment of the present teaching. When a question of “How old is UncleSam's wife?” is asked, the system searches for possible matches betweenUncle Sam and Wife. An exact match between Uncle Sam and Wife is found,and therefore, answer content is obtained from two arguments that “Annieis Uncle Sam's wife;” and “Annie is 55 years old.” In another example, aquestion of “Taylor Swift's concert in a DC stadium?” is asked. Thesystem searches for possible matches between entities including TaylorSwift, Concert, DC, and Stadium. Exact matches are found with respect toTaylor Swift and Concert, while fuzzy matches are found with respect toDC and Stadium because DC and Stadium are listed as variations ofVerizon Center. Based on the results, answer content is obtained as“Taylor Swift's concert is in Verizon Center.” It should be appreciatedthat the examples shown in FIG. 37 are for illustrative purpose;however, the present teaching is not intended to be limiting, and mayhave other decision schemes using the person-centric knowledge.

FIG. 38 depicts a general mobile device architecture on which thepresent teaching can be implemented. In this example, the person'sdevice is a mobile device 1300, including but is not limited to, a smartphone, a tablet, a music player, a handled gaming console, a globalpositioning system (GPS) receiver, a smart-TV, wearable devices, etc.The mobile device 1300 in this example includes one or more centralprocessing units (CPUs) 3802, one or more graphic processing units(GPUs) 3804, a display 3806, a memory 3808, a communication platform3810, such as a wireless communication module, storage 3812, and one ormore input/output (I/O) devices 3814. Any other suitable component, suchas but not limited to a system bus or a controller (not shown), may alsobe included in the mobile device 3800. As shown in FIG. 38, mobileoperating system 3816, e.g., iOS, Android, Windows Phone, etc., and oneor more applications 3818 may be loaded into the memory 3808 from thestorage 3812 in order to be executed by the CPU 3802. The applications3818 may include a browser or any other suitable mobile apps for sendinga question and rendering the answer results page including through themobile device 3800. Execution of the applications 3818 may cause themobile device 3800 to perform the processing as described above in thepresent teaching. For example, presentation of an answer results pagemay be made by the GPU 3804 in conjunction with the display 3806. Aquestion may be inputted by the person via the I/O devices 3814 andtransmitted via the communication platform 3810.

To implement the present teaching, computer hardware platforms may beused as the hardware platform(s) for one or more of the elementsdescribed herein. The hardware elements, operating systems, andprogramming languages of such computers are conventional in nature, andit is presumed that those skilled in the art are adequately familiartherewith to adapt those technologies to implement the processingessentially as described herein. A computer with user interface elementsmay be used to implement a personal computer (PC) or other type of workstation or terminal device, although a computer may also act as a serverif appropriately programmed. It is believed that those skilled in theart are familiar with the structure, programming, and general operationof such computer equipment and as a result the drawings should beself-explanatory.

FIG. 39 depicts a general computer architecture on which the presentteaching can be implemented. The computer may be a general-purposecomputer or a special purpose computer. This computer 3900 can be usedto implement any components of the question answering architecture asdescribed herein. Different components of the system in the presentteaching can all be implemented on one or more computers such ascomputer 3900, via its hardware, software program, firmware, or acombination thereof. Although only one such computer is shown, forconvenience, the computer functions relating to the target metricidentification may be implemented in a distributed fashion on a numberof similar platforms, to distribute the processing load.

The computer 3900, for example, includes COM ports 3902 connected to andfrom a network connected thereto to facilitate data communications. Thecomputer 3900 also includes a central processing unit (CPU) 3904, in theform of one or more processors, for executing program instructions. Theexemplary computer platform includes an internal communication bus 3906,program storage and data storage of different forms, e.g., disk 3908,read only memory (ROM) 3910, or random access memory (RAM) 3912, forvarious data files to be processed and/or communicated by the computer,as well as possibly program instructions to be executed by the CPU. Thecomputer 3900 also includes an I/O component 3914, supportinginput/output flows between the computer and other components. Thecomputer 3900 may also receive programming and data via networkcommunications.

Hence, aspects of the method of question answering, as outlined above,may be embodied in programming. Program aspects of the technology may bethought of as “products” or “articles of manufacture” typically in theform of executable code and/or associated data that is carried on orembodied in a type of machine-readable medium. Tangible non-transitory“storage” type media include any or all of the memory or other storagefor the computers, processors or the like, or associated modulesthereof, such as various semiconductor memories, tape drives, diskdrives and the like, which may provide storage at any time for thesoftware programming.

All or portions of the software may at times be communicated through anetwork such as the Internet or various other telecommunicationnetworks. Such communications, for example, may enable loading of thesoftware from one computer or processor into another. Thus, another typeof media that may bear the software elements includes optical,electrical, and electromagnetic waves, such as used across physicalinterfaces between local devices, through wired and optical landlinenetworks and over various air-links. The physical elements that carrysuch waves, such as wired or wireless links, optical links or the like,also may be considered as media bearing the software. As used herein,unless restricted to tangible “storage” media, terms such as computer ormachine “readable medium” refer to any medium that participates inproviding instructions to a processor for execution.

Hence, a machine-readable medium may take many forms, including but notlimited to, a tangible storage medium, a carrier wave medium or physicaltransmission medium. Non-volatile storage media include, for example,optical or magnetic disks, such as any of the storage devices in anycomputer(s) or the like, which may be used to implement the system orany of its components as shown in the drawings. Volatile storage mediainclude dynamic memory, such as a main memory of such a computerplatform. Tangible transmission media include coaxial cables; copperwire and fiber optics, including the wires that form a bus within acomputer system. Carrier-wave transmission media can take the form ofelectric or electromagnetic signals, or acoustic or light waves such asthose generated during radio frequency (RF) and infrared (IR) datacommunications. Common forms of computer-readable media thereforeinclude for example: a floppy disk, a flexible disk, hard disk, magnetictape, any other magnetic medium, a CD-ROM, DVD or DVD-ROM, any otheroptical medium, punch cards paper tape, any other physical storagemedium with patterns of holes, a RAM, a PROM and EPROM, a FLASH-EPROM,any other memory chip or cartridge, a carrier wave transporting data orinstructions, cables or links transporting such a carrier wave, or anyother medium from which a computer can read programming code and/ordata. Many of these forms of computer readable media may be involved incarrying one or more sequences of one or more instructions to aprocessor for execution.

Those skilled in the art will recognize that the present teachings areamenable to a variety of modifications and/or enhancements. For example,although the implementation of various components described above may beembodied in a hardware device, it can also be implemented as a softwareonly solution. In addition, the components of the system as disclosedherein can be implemented as a firmware, firmware/software combination,firmware/hardware combination, or a hardware/firmware/softwarecombination.

While the foregoing has described what are considered to be the bestmode and/or other examples, it is appreciated that various modificationsmay be made therein and that the subject matter disclosed herein may beimplemented in various forms and examples, and that the teachings may beapplied in numerous applications, only some of which have been describedherein. It is intended by the following claims to claim any and allapplications, modifications and variations that fall within the truescope of the present teachings.

We claim:
 1. A method, implemented on at least one computing device,each of which has at least one processor, storage, and a communicationplatform connected to a network for validating a classification of aquestion, the method comprising: obtaining the classification of thequestion posed by a user; presenting to the user, one or more answercards based on the classification of the question; analyzing the user'sinteraction with respect to at least one of the one or more answercards, wherein the user's interaction includes at least one of a dwelltime of the user and an action performed by the user with respect to theat least one of the one or more answer cards; and validating theclassification of the question based on the user's interaction.
 2. Themethod of claim 1, wherein the question is classified as one of apersonal question and a non-personal question.
 3. The method of claim 1,wherein the dwell time of the user corresponds to an amount of timespent by the user analyzing the at least one answer card, and the actionperformed by the user is one of clicking another answer card presentedto the user and generating a new question.
 4. The method of claim 1,wherein the step of validating the classification of the questionfurther comprising: maintaining the classification of the question inresponse to the dwell time being above a threshold time and the actioncorresponding to the user clicking on a first type of answer card; andmodifying the classification of the question in response to the dwelltime being below the threshold time and the action corresponding to theuser generating another question.
 5. The method of claim 1, wherein theone or more answer cards is selected from a data source that isdetermined based on the classification of the question.
 6. The method ofclaim 1, wherein the question is classified based on a pattern of thequestion and a feature extracted from the question.
 7. A system havingat least one processor, storage, and a communication platform connectedto a network for validating a classification of a question, the systemcomprising: a classification retriever implemented by the at least oneprocessor and configured to obtain the classification of the questionposed by a user; an answer-card generator implemented by the at leastone processor and configured to present to the user, one or more answercards based on the classification of the question; a user interactionmodule implemented by the at least one processor and configured toanalyze the user's interaction with respect to at least one of the oneor more answer cards, wherein the user's interaction includes at leastone of a dwell time of the user and an action performed by the user withrespect to the at least one of the one or more answer cards; and aclassification decision module implemented by the at least one processorand configured to validate the classification of the question based onthe user's interaction.
 8. The system of claim 7, wherein the questionis classified as one of a personal question and a non-personal question.9. The system of claim 7, wherein the dwell time of the user correspondsto an amount of time spent by the user analyzing the at least one answercard, and the action performed by the user is one of clicking anotheranswer card presented to the user and generating a new question.
 10. Thesystem of claim 7, wherein the classification decision module is furtherconfigured to: maintain the classification of the question in responseto the dwell time being above a threshold time and the actioncorresponding to the user clicking on a first type of answer card; andmodify the classification of the question in response to the dwell timebeing below the threshold time and the action corresponding to the usergenerating another question.
 11. The system of claim 7, wherein the oneor more answer cards is selected from a data source that is determinedbased on the classification of the question.
 12. The system of claim 7,wherein the question is classified based on a pattern of the questionand a feature extracted from the question.
 13. A non-transitorymachine-readable medium having information recorded thereon forvalidating a classification of a question, wherein the information, whenread by the machine, causes the machine to perform the following:obtaining the classification of the question posed by a user; presentingto the user, one or more answer cards based on the classification of thequestion; analyzing the user's interaction with respect to at least oneof the one or more answer cards, wherein the user's interaction includesat least one of a dwell time of the user and an action performed by theuser with respect to the at least one of the one or more answer cards;and validating the classification of the question based on the user'sinteraction.
 14. The non-transitory machine-readable medium of claim 13,wherein the question is classified as one of a personal question and anon-personal question.
 15. The non-transitory machine-readable medium ofclaim 13, wherein the dwell time of the user corresponds to an amount oftime spent by the user analyzing the at least one answer card, and theaction performed by the user is one of clicking another answer cardpresented to the user and generating a new question.
 16. Thenon-transitory machine-readable medium of claim 13, wherein the step ofvalidating the classification of the question further comprising:maintaining the classification of the question in response to the dwelltime being above a threshold time and the action corresponding to theuser clicking on a first type of answer card; and modifying theclassification of the question in response to the dwell time being belowthe threshold time and the action corresponding to the user generatinganother question.
 17. The non-transitory machine-readable medium ofclaim 13, wherein the one or more answer cards is selected from a datasource that is determined based on the classification of the question.